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Abstract 

It is shown that a Banach space admits an equivalent norm whose modulus of uniform con- 
vexity has power- type p if and only if it is Markov p-convex. Counterexamples are constructed to 
natural questions related to isomorphic uniform convexity of metric spaces, showing in particular 
that tree metrics fail to have the dichotomy property. 



Contents 

1 Introduction 

1.1 The nonexistence of a metric dichotomy for trees 



6.1 Horizontally contracted trees 

6.2 Geometry of -ff-trees 

6.2.1 Classification of approximate midpoints 

6.2.2 Classification of approximate forks . . . 

6.2.3 Classification of approximate 3-paths . . 

6.3 Nonembeddability of vertically faithful B4 . . . 

6.4 Nonembeddability of binary trees 



1.1.1 Overview of the proofs of Theorem 11.101 and Theorem 11.121 

2 Markov p-convexity and p-convexity coincide 

3 A doubling space which is not Markov p-convex for any p E (0, 00) Il5l 

4 Lipschitz quotients [l7 

5 A dichotomy theorem for vertically faithful embeddings of trees [l8| 

6 Tree metrics do not have the dichotomy property |21 



21 

23 

23 
30 
36 
42 
44 



7 Discussion and open problems |45 



'Supported by ISF grant 221/07, BSF grant 2006009, and a gift from Cisco research center. 

f Supported by NSF grants CCF-0635078 and CCF-0832795, BSF grant 2006009, and the Packard Foundation. 



1 



1 Introduction 



A Banach space (X, || • \\x) is said to be finitely representable in a Banach space (Y, || ■ ||y) if there 
exists a constant D < oo such that for every finite dimensional linear subspace F C X there is a 
linear operator T : i 7 — >• Y satisfying ||x||x ^ ll^^lly ^ for all x £ F. In 1976 Ribe |31j 

proved that if two Banach spaces X and Y are uniformly homeomorphic, i.e., there is a bijection 
/ : X — > Y such that / and / _1 are uniformly continuous, then X is finitely representable in Y 
and vice versa. This remarkable theorem motivated what is known today as the "Ribe program" : 
the search for purely metric reformulations of basic linear concepts and invariants from the local 
theory of Banach spaces. This research program was put forth by Bourgain in 1986 [5j. 

Since its inception, the Ribe program attracted the work of many mathematicians, and led to 
the development of several satisfactory metric theories that extend important concepts and results 
of Banach space theory; see the introduction of [24] for a historical discussion. So far, progress 
on the Ribe program has come hand-in-hand with applications to metric geometry, group theory, 
functional analysis, and computer science. The present paper contains further progress in this 
direction: we obtain a metric characterization of p-convexity in Banach spaces, derive some of 
its metric consequences, and construct unexpected counter-examples which indicate that further 
progress on the Ribe program can uncover nonlinear phenomena that are markedly different from 
their Banach space counterparts. In doing so, we answer questions posed by Lee-Naor-Peres and 
Fefferman, and improve a theorem of Bates, Johnson, Lindenstrauss, Preiss and Schechtman. These 
results, which will be explained in detail below, were announced in [23j . 

For p ^ 2, a Banach space (X, \\ ■ \\x) is said to be p-convex if there exists a norm ||| • ||| which is 
equivalent to || • \\x (i-e., for some a, b > 0, a||x||x ^ ^ for all x G X), and a constant 

K > satisfying: 

< I - K\\\x - y\f . (1) 

X is called super reflexive if it is p-convex for some p ^ 2 (historically, this is not the original 
definition of superrefiexivitj0, but it is equivalent to it due to a deep theorem of Pisier [29] , which 
builds on important work of James [10] and Enflo [7]). For concreteness, we recall (see, e.g., [2]) 
that L p is 2-convex for p £ (1,2] and p-convex for p € [2, oo). 

Ribe's theorem implies that p-convexity, and hence also superreflexivity, is preserved under 
uniform homeomorphisms. The first major success of the Ribe program is a famous theorem of 
Bourgain [5] which obtains a metrical characterization of superreflexivity as follows. 

Theorem 1.1 (Bourgain's metrical characterization of superreflexivity [5J). Let B n be the complete 
unweighted binary tree of depth n, equipped with the natural graph-theoretical metric. Then a 
Banach space X is superreflexive if and only if 

lim c x (B n ) = oo. (2) 

n— >oo 

Here, and in what follows, given two metric spaces (^#, d^), (>yV , d yy), the parameter cjt{y¥) 
denotes the smallest bi-Lipschitz distortion with which jY embeds into i.e., the infimum of 

1 James' original definition of superreflexivity is that a Banach space X is superreflexive if "its local structure forces 
reflexivity" , i.e., if every Banach space Y that is finitely representable in X must be reflexive. Enflo's renorming 
theorem states that superreflexivity is equivalent to having an equivalent norm ||| ■ ||| that is uniformly convex, i.e., 
for every e £ (0, 1) there exists 8 > such that if 1 1 \x\ \ | = 1 1 \y\ 1 1 — 1 and |||a; — y\\\ = e then | \x + y\ \ ^2 — 5. 



X 



x + y 
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those D > such that there exists a scaling factor r > and a mapping / : ,JV — > ^ satisfying 
rdjyix^y) ^ d^(x,y) ^ Drd^(x,y) for all x,y € JY (if no such / exists then set cj?{jV) = oo). 

Bourgain's theorem characterizes superrefiexivity of Banach spaces in terms of their metric 
structure, but it leaves open the characterization of p-convexity. The notion of p-convexity is 
crucial for many applications in Banach space theory and metric geometry, and it turns out that 
the completion of the Ribe program for p-convexity requires significant additional work beyond 
Bourgain's superrefiexivity theorem. As a first step in this direction, Lee, Naor and Peres [16] 
defined a bi-Lipschitz invariant of metric spaces called Markov convexity, which is motivated by 
Ball's notion of Markov type [lj and Bourgain's argument in [5]. 

Definition 1.2 (|16j). Let {Xt}t^z be a Markov chain on a state space £1. Given an integer k ^ 0, 
we denote by {Xt(k)}t£z the process which equals Xt for time t ^ k, and evolves independently 
(with respect to the same transition probabilities) for time t > k. Fix p > 0. A metric space 
(X,dx) is called Markov p-convex with constant II if for every Markov chain {Xt}tez on a state 
space Q, and every f : — > X, 

~ E\d x (f(X t )j(x t {t-2 k ))Y} 

EE^^ ^ — < nP • E E [^(/(^)'/( x *-i)) p ]- ( 3 ) 

k=o t&z tez 

The least constant II for which Q holds for all Markov chains is called the Markov p-convexity 
constant of X, and is denoted H p (X). We shall say that (X, dx) is Markov p-convex ifU p (X) < oo. 

To gain intuition for Definition ll.2[ consider the standard downward random walk starting from 
the root of the binary tree B n (with absorbing states at the leaves). For an arbitrary mapping / 
from B n to a metric space (X, dx ) , the triangle inequality implies that for each fc£Nwe have 

E \d x (f(X t ),f(x t (t - 2 k ))Y] 
E^^ 2^ J -^< p ^[dx(f(X t )J(X t ^))% (4) 

tez tez 

with asymptotic equality (up to constants depending only on p) for k ^ ^rp when X = B n and / 
is the identity mapping. On the other hand, if X is a Markov p-convex space then the sum over k of 
the left-hand side of (j4]) is uniformly bounded by the right-hand side of ([!]), and therefore Markov 
p-convex spaces cannot contain B n with distortion uniformly bounded in n. 

We refer to [16] for more information on the notion of Markov p-convexity. In particular, it 
is shown in [16] that the Markov 2-convexity constant of an arbitrary weighted tree T is, up to 
constant factors, the Euclidean distortion of T. We refer to [16J for L p versions of this statement 
and their algorithmic applications. It was also shown in |16j . via a modification of an argument of 
Bourgain [5], that if a Banach space X is p-convex then it is also Markov p-convex. It was asked 
in [16] if the converse is also true. Here we answer this question positively: 

Theorem 1.3. A Banach space is p-convex if and only if it is Markov p-convex. 

Thus Markov p-convexity is equivalent to p-convexity in Banach spaces, completing the Ribe 
program in this case. Our proof of Theorem 11.31 is based on a renorming method of Pisier [29]. It 
can be viewed as a nonlinear variant of Pisier's argument, and several subtle changes are required 
in order to adapt it to a nonlinear condition such as (J3]). 
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Results similar to Theorem 11.31 have been obtained for the notions of type and cotype of Banach 
spaces (see [61 [301 [Tj ES, EH [22] ) , and have been used to transfer some of the linear theory to the 
setting of general metric spaces. This led to several applications to problems in metric geometry. 
Apart from the applications of Markov p-convexity that were obtained in [16] , here we show that this 
invariant is preserved under Lipschitz quotients. The notion of Lipschitz quotient was introduced 
by Gromov [8] Sec. 1.25]. Given two metric spaces (X, dx) and (Y,dy), a surjective mapping 
/ : X — > Y is called a Lipschitz quotient if it is Lipschitz, and it is also "Lipschitzly open" in the 
sense that there exists a constant c > such that for every i£l and r > 0, 



Here we show the following result: 

Theorem 1.4. If (X,dx) is Markov p-convex and (Y,dy) is a Lipschitz quotient of X, then Y is 
also Markov p-convex. 

In [3] Bates, Johnson, Lindenstrauss, Preiss and Schechtman investigated in detail Lipschitz 
quotients of Banach spaces. Their results imply that if 2 ^ p < q then L q is not a Lipschitz 
quotient of L p . Since Lp is p-convex, it is also Markov p-convex. Hence also all of its subsets are 
Markov p convex. But, L q is not p-convex, so we deduce that L q is not a Lipschitz quotient of any 
subset of L p . Thus our new "invariant approach" to the above result of [3] significantly extends it. 
Note that the method of [3] is based on a differentiation argument, and hence it crucially relies on 
the fact that the Lipschitz quotient mapping is defined on all of L p and not just on an arbitrary 
subset of L p . 

In light of Theorem 1 1.31 it is natural to ask if Bourgain's characterization of superreflexivity holds 
for general metric spaces. Namely, is it true that for any metric space X, if lim n ^ cx(B n ) = oo 
then X is Markov p-convex for some p < oo? This question was asked in |16j . Here we show that 
the answer is negative: 

Theorem 1.5. There exists a metric space (X,dx) which is not Markov p-convex for any p € 
(0,oo) ; yet liuin^oo cx(B n ) = oo. In fact, (X,dx) can be a doubling metric space, and hence 
cx(B n ) ^ 2 Kn for some constant k > 0. 

Theorem 11.51 is in sharp contrast to the previously established metric characterizations of the 
linear notions of type and cotype. Specifically, it was shown by Bourgain, Milman and Wolfson [6] 
that any metric space with no nontrivial metric type must contain the Hamming cubes ({0, l} n , ||-||i) 
with distortion independent of n. An analogous result was obtained in |24j for metric spaces with 
no nontrivial metric cotype, with the Hamming cube replaced by the £oq grid ({1, . . . , m} n , \\ ■ ||oo)- 

Our proof of Theorem 11.51 is based on an analysis of the behavior of a certain Markov chain on 
the Laakso graphs: a sequence of combinatorial graphs whose definition is recalled in Section [3l As 
a consequence of this analysis, we obtain the following distortion lower bound: 

Theorem 1.6. For any p ^ 2, the Laakso graph of cardinality n incurs distortion fi((log n) 1 ^) in 
any embedding into a p-convex Banach space. 

Thus, in particular, for p > 2 the n-point Laakso graph incurs distortion f2((log n) 1//p ) in any 
embedding into L p . The case of L p embeddings of the Laakso graphs when 1 < p ^ 2 was already 
solved in [261 021 EH [H] using the uniform 2-convexity property of L p . But, these proofs rely 




(5) 
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crucially on 2-convexity and do not extend to the case of p-convexity when p > 2. Subsequent to 
the publication of our proof of Theorem 11.61 in the announcement [23] , an alternative proof of this 
fact was recently discovered by Johnson and Schechtman in [llj. 

1.1 The nonexistence of a metric dichotomy for trees 

Bourgain's metrical characterization of superreflexivity yields the following statement: 

Theorem 1.7 (Bourgain's tree dichotomy |5j). For any Banach space (X, \\-\\x) one of the following 
two dichotomic possibilities must hold true: 

• either for all n £ N we have cx{B n ) = 1, 

• or there exists a = ax > such that for all n G N we have cx(B n ) ^ (logn) a . 

Thus, there is a gap in the possible rates of growth of the sequence {cx(B n )}^ =1 when X is 
a Banach space; consequently, if we were told that, say, cx(B n ) = O(loglogn), then we would 
immediately deduce that actually cx{B n ) = 1 for all n. Additional gap results of this type are 
known due to the theory of nonlinear type and cotype: 

Theorem 1.8 (Bourgain-Milman-Wolfson cube dichotomy [6]). For any metric space (X,dx) one 
of the following two dichotomic possibilities must hold true: 

• either for all n € N we have cx ({0, l} n , || • ||i) = 1, 

• or there exists a = ax > such that for all n E N we have cx ({0, l} n , || • ||i) n a . 

Theorem 11.81 is a metric analogue of Pisier's characterization [28J of Banach spaces with trivial 
Rademacher type. A metric analogue of the Maurey-Pisier characterization [20] of Banach spaces 
with finite Rademacher cotype yields the following dichotomy result for grids: 

Theorem 1.9 (Grid dichotomy [23]). For any metric space (X,dx) one of the following two 
dichotomic possibilities must hold true: 

• either for all n 6 N we have cx ({0, . . . , n} n , \\ ■ ||oo) = 1, 

• or there exists a = ax > such that for all n G N we have cx ({0, . . . , n} n , || • ||oo) ^ n a . 

We refer to the survey article [21] for more information on the theory of metric dichotomies. 

Note that Theorem 11.71 is stated for Banach spaces, while Theorem 11.81 and Theorem 11.91 hold 
for general metric spaces. One might expect that as in the case of previous progress on Ribe's 
program, a metric theory of p-convexity would result in a proof that Theorem 1 1 . 71 holds when X is 
a general metric space. Surprisingly, we show here that this is not true: 

Theorem 1.10. There exists a universal constant C > with the following property. Assume that 
{s(n)}^ =0 C [4, oo) is a nondecreasing sequence such that {n/s(n)}^ =0 is also nondecreasing. Then 
there exists a metric space (X,dx) satisfying for all n ^ 2, 



s 



40s(n) 



n 




(6) 



Thus, assuming that s(n) = o 




logiogw j ' ^ iere exists a subsequence {ni t }'^L 1 for which 
(l-o(l))s{n k )^cx(B nk ) <s(n fc ). 



(7) 
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Theorem 11.101 shows that unlike the case of Banach spaces, for general metric spaces, cx(B n ) 
can have an arbitrarily slow growth rate. 

Bourgain, Milman and Wolfson also obtained in [6j the following finitary version of Theorem ll.81 

Theorem 1.11 (Local rigidity of Hamming cubes [6]). For every e > 0, D > 1 and n £ N there 
exists m = m(e, D, n) € N such that 

lim m(e,D,n) = oo, 

n— >oo 

and for every metric d on {0, l} n which is bi-Lipschitz with distortion ^ D to the l\ (Hamming) 
metric, 

c ({ o,i}M) ({0, II • 111) < l + e. 

We refer to [6] (see also |30j ) for bounds on m(e,D,n). Informally, Theorem 1 1 . 1 1 1 savs that the 
Hamming cube ({0, l} n , || • ||i) is locally rigid in the following sense: it is impossible to distort the 
Hamming metric on a sufficiently large hypercube without the resulting metric space containing 
a hardly distorted copy of an arbitrarily large Hamming cube. Stated in this way, Theorem 11.111 
is a metric version of James' theorem [9] that l\ is not a distortable space. The analogue of 
Theorem 11.111 with the Hamming cube replaced by the grid ({0, . . . ,n} n , || • \ ^) is Matousek's 
BD-Ramsey theorem |19j : see |24j for quantitative results of this type in the case. The following 
variant of Theorem II .101 shows that a local rigidity statement as above fails to hold true for binary 
trees; it can also be viewed as a negative solution of the distortion problem for the infinite binary 
tree (see [27] and [H Ch. 13, 14] for more information on the distortion problem for Banach spaces). 

Theorem 1.12. Let be the complete unweighted infinite binary tree. For every D ^ 4 there 
exists a metric d on that is D -equivalent to the original shortest-path metric on B^, yet for 
every e € (0, 1) and m S N, 

C(iW)OBm) < D - e m < D CL)2 l e . 

The local rigidity problem for binary trees was studied by several mathematicians. In partic- 
ular, C. Fefferman asked (private communication, 2005) whether {-B n }^i have the local rigidity 
property, and Theorem 11.121 answers this question negatively. Fefferman also proved a partial local 
rigidity result which is a non-quantitative variant of Theorem 11.141 below (see also Section [5]) . We 
are very grateful to C. Fefferman for asking us the question that led to the counter-examples of 
Theorem 1 1 . 1 01 and Theorem ll.121 for sharing with us his partial positive results, and for encouraging 
us to work on these questions. M. Gromov also investigated the local rigidity problem for binary 
trees, and proved (via different methods) non-quantitative partial positive results in the spirit of 
Theorem 11.141 We thank M. Gromov for sharing with us his unpublished work on this topic. 

The results of Theorem II . 101 and Theorem 1 1 . 1 21 are quite unexpected. Unfortunately, their proofs 
are delicate and lengthy, and as such constitute the most involved part of this article. In order to 
facilitate the understanding of these constructions, we end the introduction with an overview of 
the main geometric ideas that are used in their proofs. This is done in Section II. 1.11 below — we 
recommend reading this section first before delving into the technical details presented in Section [6l 



6 



1.1.1 Overview of the proofs of Theorem 11.101 and Theorem 11.121 



For x G Boo let h(x) be its depth, i.e., its distance from the root. Also, for x,y G Boo let \ca(x,y) 
denote their least common ancestor. The tree metric on B^ is then given by: 

dB^ixiV) = h(x) + h(y) - 2h(\ca(x,y)). 

The metric space X of Theorem 11.101 will be B^ as a set, with a new metric defined as follows. 
Given a sequence e = {e n }^L C (0, 1] we define d £ : Boo x B^ — > [0, oo) by 

d £ (x,y) = \h(y) - h(x)\ + 2e m - m{h(x)My)} ■ [mm{h(x) , h(y)} - h(\ca(x,y))} . 

d £ does not necessarily satisfy the triangle inequality, but under some simple conditions on the 
sequence {e n }^ =0 it does become a metric on Boo; see Lemma loTTl A pictorial description of the 
metric d e is contained in Figured) Note that when e n = 1 for all n, we have d £ = ds^- Below we 
call the metric spaces (Boo,d £ ) horizontally distorted trees, or H-tiees, in short. 




Figure 1: The metric d £ defined on Boo- The arrows indicate horizontal contraction by £h(x)- 

The metric space (X, dx) of Theorem 11.101 will be (Boo,d £ ), where e n = \/s{n) for all n. The 
identity mapping of B n into the top n- levels of -Boo has distortion at most s(n), and therefore 
cx(B n ) ^ s(n). The challenge is to prove the lower bound on cx(B n ) in ([6]). Our initial approach 
to lower-bounding cx(B n ) was Matousek's metric differentiation proof [18] of asymptotically sharp 
distortion lower bounds for embeddings of B n into uniformly convex Banach spaces. 

Following Matousek's terminology [18], for 5 > a quadruple of points (x,y,z,w) in a met- 
ric space (X, dx) is called a <5-fork if y G Mid(x,z, 5) n Mid (a;, w, 5), where for a, b G X the 
set of (5-approximate midpoints Mid(a, b,S) C X is defined as the set of all w G X satisfying 
max{dx (x , y) , dx (y , z)} ^ ijj^ • dx(x,z). The points z,w; will be called below the prongs of the 
(5-fork (x,y,z,w). Matousek starts with the observation that if X is a uniformly convex Banach 
space then in any (5-fork in X the distance between the prongs must be much smaller (as 6 — > 0) 
than dx(x,y). Matousek then shows that for all D > 0, any distortion D embedding of B n into X 
must map some 0-fork in B n to a 5-fork in X, provided n is large enough (as a function of D and 
5). This reasoning immediately implies that cx(B n ) must be large when X is & uniformly convex 
Banach space, and a clever argument of Matousek in [18] turns this qualitative argument into sharp 
quantitative bounds. 
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Of course, we cannot hope to use the above argument of Matousek in order to prove Theo- 
rem 1 1,10"! since Bourgain's tree dichotomy theorem (Theorem ll.7p does hold true for Banach spaces. 
But, perhaps we can mimic this uniform convexity argument for other target metric spaces? On 
the face of it, H-trees are ideally suited for this purpose, since the horizontal contractions that we 
introduced shrink distances between the prongs of canonical forks (call (x, y, z, w) E a canonical 
fork if x is an ancestor of y and z, w are descendants of y at depth h(x) + 2(h(y) — h(x))). It is for 
this reason exactly that we defined -ff-trees. 

Unfortunately, the situation isn't so simple. It turns out that H-trees do not behave like 
uniformly convex Banach spaces in terms of the prong-contractions that they impose of 5- forks, 
.ff-trees can even contain larger problematic configurations that have several undistorted 5- forks; 
such an example is depicted in Figure 




Figure 2: The metric space on the right is the H-tree (B^, d £ ), where e n = 5 for all n. The picture 
describes an embedding of the tree on the left (B^ minus 4 leaves) into (-E>oo,<i e ) with distortion at 
most 6, yet all ancestor/ descendant distances are distorted by at most 1 + 0(8). 



Thus, in order to prove Theorem 11.101 it does not suffice to use Matousek's argument that a 
bi-Lipschitz embedding of a large enough B n must send some 0-fork to a (5-fork. But, it turns out 
that this argument applies not only to forks, but also to larger configurations. 

Definition 1.13. Let (T,dx) be a tree with root r, and let (X,dx) be a metric space. A mapping 
f : T — )• X is called a D-vertically faithful embedding if there exists a (scaling factor) A > 
satisfying for any x,y € T such that x is an ancestor of y, 

Xd T (x,y) ^ d x (f(x),f(y)) < DXd T (x,y). (8) 

JV between metric spaces d^) and 



Recall that the distortion of a mapping <f> : ^# 
(jV,d^) is defined as 



dist(» 



clef 



sup 



d^r(<t>{x),<t>{y) ) 
'« dj?(x,y) 



. x, v e.* d^((j)(x),(j)(y)) 



€ [l,oo]. 



With this terminology, we can state the following crucial result. 
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Theorem 1.14. There exists a universal constant c > with the following property. Fix an integer 
t ^ 2, 5, £ G (0, 1), and D 2, and assume that n £ N satisfies 

n ;> ]_ D c(t\o g t)/& _ ( 9 ) 

Let (X,dx) be a metric space and f : B n — >• X a D -vertically faithful embedding. Then there exists 
a mapping <f> : Bt —> B n with the following properties. 

• If x,y G Bt are such that x is an ancestor of y, then 4>(x) is an ancestor of <p(y). 

• dist(» < 1 + f ■ 

• The mapping f o <f> ; B t — > X is a (1 + 5) -vertically faithful embedding of B t in X. 

Theorem 11.141 is essentially due to Matousek |18j . Matousek actually proved this statement 
only for t = 2, since this is all that he needed in order to analyze forks. But, his proof extends in 
a straightforward way to any t € N. Since we will use this assertion with larger t, for the sake of 
completeness we reprove it, in a somewhat different way, in Section [5j Note that Theorem 11.141 savs 
that {B n } c £' =l do have a local rigidity property with respect to vertically faithfully embeddings. 

We solve the problem created by the existence of configurations as those depicted in Figure [5] by 
studying (1 + 5)-vertically faithful embeddings of B^, and arguing that they must contain a large 
contracted pair of points. This claim, formalized in Lemma 16.271 is proved in Sections 16.21 16.31 

We begin in Section P6 . 2 . 11 with studying how the metric P2 (3-point path) can be approximately 
embedded in (Boo^de). We find that there are essentially only two ways to embed it in (Bo^de), 
as depicted in Figure [3J We then proceed in Section T6.2.2I to study 5- forks in (i?oo,(i £ ). Since 
forks are formed by "stitching" two approximate P2 metrics along a common edge (the handle), we 
can limit the "search space" using the results of Section 16.2.11 We find that there are six possible 
types of different approximate forks in (Bo^de), only four of which (depicted in Figured]) do not 
have highly contracted prongs. Complete binary trees, and in particular B4, are composed of forks 
stitched together, handle to prong. In order to study handle-to-prong stitching, we investigate in 
Section [6.2.31 how the metric P3 (4-point path) can be approximately embedded in (Soo,^). This 
is again done by studying how two P2 metrics can be stitched together, this time bottom edge to 
top edge. We find that there are only three different approximate configurations of P4 in (B^jds). 

Using the machinery described above, we study in Section 16.31 how the different types of forks 
can be stitched together in embeddings of B^ into (-Boo d e ), reaching the conclusion that a large con- 
traction is unavoidable, and thus completing the proof of Lemma [6. 2 71 The proofs of Theorem ll.101 
and Theorem 1 1 . 1 2 1 are concluded in Section [6.41 

2 Markov j>convexity and p-convexity coincide 

In this section we prove Theorem 11.31 i.e., that for Banach spaces p-convexity and Markov p- 
convexity are the same properties. We first show that p-convexity implies Markov p-convexity, 
and in fact it implies a stronger inequality that is stated in Proposition 12.11 below. The slightly 
weaker assertion that p-convexity implies Markov p-convexity was first proved in [16J, based on an 
argument from [5]. Our argument here is different and simpler. 
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It was proved in [29J that a Banach space X is p-convex if and only if it admits an equivalent 
norm || • || for which there exists K > such that for every a, b £ X, 



2\\ar + —\W^\\a + br + \\a-br. 



(10) 



Proposition 2.1. Let {Xt}teZ be random variables taking values in a set tt. For every s € Z let 

Xt(s) > be random variables taking values in Q, with the following property: 
) tez 



V r ^ s ^ t, (X r ,Xt) and \X r ,Xt(s) ) have the same distribution. 



(11) 

Fix p ^ 2 and let (X, || • ||) be a Banach space whose norm satisfies (fTU|) , Then for every f : — > X 
we have 



EE 

fc=o tez 



E 



^f{X t )-f(x t {t-2 k ) 



2kp 



tez 



(12) 



Remark 2.2. Observe that condition (jlip holds when \Xi\t&L is o, Markov chain on a state space 

£1, and I Xt(s) > is as in Definition \1.2\ . 
I J tez 



We start by proving a useful inequality that is a simple consequence of (flOj) . 
Lemma 2.3. Let X be a Banach space whose norm satisfies (|10[) . Then for every x,y,z,w £ X, 
\\x — w\\ p + llx — z\\ p \\z — w\\ p 



+ 



2p-i 4p-ij{p 
Proof. For every x,y,z,w € X, (fit))) implies that 



^ \\y — w\\ p + \\z — y\\ p + 2\\y — x\ 



(13) 



\x — w 
2 p ~ l 



\p 2 
- + 



and 



\z — x\ 
2P- 1 



+ 



K p 
2 

~KP 



y 



X + w 



Z + X 



^ 1 1 2/ - x\\ P + lb ~ HI P > 



^ ||z — y\\ p + \\y — x\\ p . 



Summing these two inequalities, and applying the convexity of the map u i— >■ \\u\\ p , we see that 



\y - w\\ p + \\z - y\\ p + 2\\y - x\\ p ^ 



\x — w\ 



2 p-i 



z - x\\ p 4 
+ 

\p 



I _ x+w IIP i II _ z+a ||P 
l y 2 II ~ r ll» 2 II 



x — w\\ p + z — x 



+ 



z — w 



2 p-i 1 x p 

implying ([13]). 

Proof of Proposition \2.1\ Using Lemma 12.31 we see that for every t 6 Z and A; € N, 

ll/TO - /(x,_ 2fc )p + - 2*- 1 )) - /CXt_gOr |[/CXt) - HMt - 2 fc - 1 ))||p 

2p-i + 4P-±K p 

< ii/(^ t -2^i) - /(^)ir + n/(*t-2*-i) - - 2 fc - 1 ))ir + 2|i/(x t _ 2 *_ 1 ) - /(x t _ 2fe )ii p . 



□ 
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Taking expectation, and using the assumption (|TTj) . we get 



E[\\f(X t )-f(X t _ 2k W] | E[\\f(X t )-f(X t (t-2^)W 



2P- 2 



^ 2E [\\f{X t _ 2 ^) - f(X t W] + 2E [\\f{X t _ 2k -y 



Dividing by 2^ k 1 ) p+2 this becomes 

E[\\f(X t )-f(X t _ 2k )\\P] , E[\\f(X t )-f(Mt-2^)W 



2 k P 



EjWfiX^-^-fjXtW] ^[\\f{X t _ 2k -,)-f(X t _ 2k )\\P] 
2(fe-i)p+i 2( fc " 1 )P+ 1 



Summing this inequality over k = 1 , . . . , m and t E Z we get 

f^ V E[ll/(^)-/(^- 2 Qr] f> V [e||/(JT«) -/&(*- 2 
Z.^ Z^ 9 fe P Z.^ Z^ 



,fc-l\\llp 



A;=l teZ 

k=i tez 

m—l 



k=l tez 



vv E[||/(X,_ 2fc - 1 )-/(X t )||P] 

Z^ Z^ o(k-Dv+i Z_> 2_> 



EE 

j=o sez 



2(fc-i)p+i 

E[n/(x,)-/(x s _ 2J )r] 

2iP 



fc=i 



2{k+i)vKP 

¥.[\\f{X t _ 2k -,)-f(X t _ 2k )\\P] 
2(fe-i)p+i 



(14) 



It is only of interest to prove (fT2|) when E^ tGZ E[||/(X t ) — /(X t _i)|| p ] < oo. By the triangle 
inequality, this implies that for every fceNwe have E^ igZ E [||/(X t ) — /(^t_2 fe )l! p ] < 00 • We may 
therefore cancel terms in (|14p . arriving at the following inequality: 



EE 

fc=i tez 



E[\\f(X t )-f(X t (t-2 k ^))\\ 



< E E til/ w - /(^-oih - E E[ll/(Xt) 2i (Xt ' 2m)r] < £ E ["/ w - /(^-oih 

tez tez tez 



tGZ 

Equivalently, 



g E[||/(I t )-f(I t (i-2 



,fc\\||p 



fc=o tez 

Proposition 12.11 now follows by letting m — > oo. 



^(4^ee[h/(^)-/(^-i)ih 

tez 



□ 



We next prove the more interesting direction of the equivalence of p-convexity and Markov 
p-convexity: a Markov p-convex Banach space is also p-convex. 
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Theorem 2.4. Let (X, || • ||) be a Banach space which is Markov p-convex with constant II. Then 
for every e G (0, 1) there exists a norm j|| • ||| on X such that for all x,y G X, 

(1 — e)||x|| ^ |||x||| ^ ||x||, 



and 



Thus the norm 



x + y 



\x\\\ p + \\\y\\\ p l-(l-e) 



4IP(p + 1) 



x-y 



satisfies (flOj) with constant K = O 



Proof. The fact that X is Markov p-convex with constant II implies that for every Markov chain 
{Xt}tez with values in X, and for every m G N, we have 



m 2™ E \\\X t - X t (t-2 k )\\ p 

EE— 

fc=0 t=i 



2 k 'P 



<np^E[||x t -x t _if] 



(15) 



t=i 



For i£lwe shall say that a Markov chain {Xt}| = _ oc is an m- admissible representation of x 
if jf t = for t < and E [X t ] = tx for t G {1, . . . , 2 m }. Fix e G (0, 1), and denote r/ = 1 - (1 - 
For every m G N define 



MIL = inf 



1 ^_ r _ r, i ^^^[\\X t - X t (t-2 k )\\ p 



-LYE[\\x t -x t - 1 \\p]-£.-—YY 

i=l fc=0 t=l 



1/p- 



2 fc P 



, (16) 



where the infimum in (|16p is taken over all m-admissible representations of x. Observe that an 
m-admissible representation of x always exists, since we can define Xt = for t ^ and Xt = tx 
for t G {1, . . . , 2 m }. This example shows that |||sc||| ^ ||x||. On the other hand, if {Xt}t=—oo 1S an 
m-admissible representation of x then 



2" 1 „ m 2™ £ [||X t -Xi(t-2 fc )|| P 

E ■ - n - £ i; E _U! — _^ — ^_ 

fc=0 i=l 



03 



t=l 



t=l 



> (1 - e) p ^ ||E [X t ] - E [X t _i] f = (1 - e) p ^ ||te - (i - l)x\\ p = 2 m (l - e)P||xf, (17) 



t=i 



where in the first inequality of (|17p we used the convexity of the function z i— >• ||-2|| p . In conclusion, 
we see that for all x G X, 



fl - e)|b|| ^ IlkllL ^ \\x\ 



(18) 



Now take x,y £ X and fix 5 G (0, 1). Let {X t }1=_ 00 be an admissible representation on x and 
{Y t }t = _ QO be an admissible representation of y which is stochastically independent of {Xt}f = _ 0O , 
such that 



m 2" 



E [||X t -X t (t-2 



fcMIP 



t=l 



fc=0 t=l 



2&p 



^2 m (|||x|L + 5), 



(19) 
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and 



m 2m e [||y t -y t (£-2 fc )| 



np 



2^p 



< 2 r 



(20) 



t=l fc=0 t=l 

Define a Markov chain {Z t YZ + -^ Q X as follows. For t < -2 m set Z t = 0. With probability § 
let (Z_2 m +i) Z— 2 m +2i • • • ■> Z 2 m+i) squal 

^0, . . . , 0, -Xl, X2, . . . , ^2 m > ^2 m + 5^1) X 2 m + 3^2) • • • > ^2 m + Y 2 mJ j 
2 m times 

and with probability | let (Z_2 m +i, ^-2 m , ■ • • j Z 2 m+i) equal 

(0, . . . , Q,Y lf Yz, ... , Y 2m ,X 1 + Y 2 m, X 2 + Y 2 m, . . . , X 2 m + y 2 " 



2 m times 



Hence, Z t = for t < 0, for t £ {1, ... , 2 m } we have E \Z t \ 
t G {2 m + 1, . . . , 2 m+1 } we have 



E[X t ]+E[y t ] 

2 



i • ^jp, and for 



E[X 2 m + Y t _ 2 m]+E[X t _ 2m +Y 2 m] 2 m x + (t - 2 m )y + (t - 2 m )x + 2 m y x + y 
^ l z d = o = o = * ' 



Thus {-Ztlt^—oo is an + Inadmissible representation of ^-5^. The definition (fT6|) implies that 



2' 



m+l 



x + y 



m+l t=l 

Note that by definition 

2?n + l 



2-+1 ™+i2" l+1 E [llZi -Z t (i-2 fc )P 

fc=0 i=l 



2 k 'P 



J2n\\Zt-Z t ^\f] = ^[\\Xt-X t ^f] + ^[\\Yt-Y t ^f] 



t=i 



t=i 



t=i 



Moreover, 



m+l2 m+l jg \^ Zt _ ^ _ 2 fc^||P 

fc=0 i=l 



2kp 



1 



2Ti + l 



m 2" 



2(m+l)p 



^E[||Z t -Z t (t-2^)f]+^^ 



1 E[||Z t -Z t (t-2 fc ) 



fc\||P 



fc=0 t=i 



2 k P 



(21) 



(22) 



• (23) 



We bound each of the terms in (|23|) separately. Note that by construction we have for every 

te{i,...,2™}, 



Z t -Z t (t- 2 m+1 ) =Z t -Z t (l- 2 m+1 ) = < 



Xt — Yt with probability 1/4, 

Yt — Xt with probability 1/4, 

X 4 - Z t (l) with probability 1/4, 

Y t - Y t (l) with probability 1/4. 
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2( 



Thus, the first term in the right hand side of (|23j) can be bounded from below as follows: 

nm-)-l 2 m 

W E e [11* - ^ - 2m+1 )ir] * E e [ii^ - w 

*=i 

2 m ||x-yp 



i=l 



2(m+l)p+l 



E* p 



2P+i(p + l) 



• (24) 



We now proceed to bound from below the second term in the right hand side of (|23p • Note first 
that for every k G {0, . . . , m} and every t G {2 m + 1, . . . , 2 m+1 } we have 



Z t -Z t {t-2 k ) = I 



(X 2 m + Y t - 2 ™) - (X 2 m(t - 2 k ) + Y t _ 2 m(t - 2 m - 2 k )j with probability 1/2, 
(Y 2 m + X t ^ 2 m) - [Y 2 m(t - 2 k ) + X t - 2 ™ (t - 2 m - 2 k ) ) with probability 1/2. 



By Jensen's inequality, if U,V are X- valued independent random variables with E[V] = 0, then 
E [\\U + V\\p] ^ E [\\U + E[V]\\ P ] = E [\\U\\ P ]. Thus, since {JQ}?™_oo and {Y t }^-oo are independent, 



E 

and 

E 



\Yt 



t-2 T: 



Yt— 2 m (t — 2 m — 2^) -\- X 2 m — X 2 m (t — 2 



k\\\P 



> E 



m r)k\\\P 



\Yt- 2 m — Yf^ 2 m(t — 2" 1 — 2 



\Xf- 2 m — Xt^ 2 m (t — 2 m — 2 k ) + Y 2 m — Y 2 m (t — 2^) \F 



> E 



\Xf_ 2 m — Xf_ 2 m (t — 2 m — 2 



It follows that for every k G {0, . . . , m} and every t G {2 m + 1, . . . , 2 m+1 } we have 



E 



\Z t -Z t (t-2 k )\\ p ^ 

\Xt— 2 m — Xf— 2 m{t — 2 m — 2 fc )|| p 



^ -E 
2 



+ -E 
2 



\Y t - 2 m -Y t ^ 2m (t-2 m -2 k )\\ p . (25) 



Hence, 



m i m+1 E^Zt - Z t (t - 2 k )\\ p 
fe=0 i=l 



> EE — 

fc=0 t=i 



\X t -X t (t-2 



k\\\P 



+ ±E 



lit - Y t (t - 2 



k\\\P 



2 k P 



E j¥^_2"> — Xt— 2 m(t — 2 m — 2 



m 2 m+1 I 

+ E E - 

fc=0 t=2 m +l 

m 2 m E [||X t -X t (i - 2 fc )|| p 
fc=0 t=i 



k\\\P 



+ ±E 



1* 



t-2 m 



Et_ 2m (t-2 m -2 



fc\||P 



2^p 

m 2m E [||k -y t (t 

+EE- 01 

fc=0 t=i 



>fc\||P 



2 k P 



(26) 



Combining (USD, {20]), (J2H) , ([22]), {23]), {Ml and {26]), and letting 5 tend to 0, we see that 



ym+l 



x + y 



^2 m \\\x\f m + 2 



m 



m+1 



t? 2 m ||x-yf 

n? ' 2P +1 (p + i) : 
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or, 



x + y 



\ x \\\m + llmil),) 



m+1 



4W(p + 1) 



x-y 



Define for w £ X, 



\w\\\ = limsup |||io| 

m— >oo 



Then a combination of (|18|) and (j'27p yields that 

(1 — e)\\x\\ ^ |||x||| ^ ||x| 

and 



x + y 



\ x W + IIMIf 



v 



4HP(p + 1) 



x-y 



\x\\\ p + [||y||| p 



4UP(p+ 1) 



Note that (j28j) implies that the set {rr € X : |||x||| ^ 1} is convex, so that 
This concludes the proof of Theorem 12.41 



(27) 



x-y 



(28) 



is a norm on X. 

□ 



3 A doubling space which is not Markov ]9-convex for any p G (0, oo^ 



Consider the Laakso graphs [12], {Gj}^ , which are Go 
defined as follows. Go is the graph consisting of one edge 
of unit length. To construct Gi, take six copies of 
and scale their metric by a factor of j. We glue four of 11 
them cyclicly by identifying pairs of endpoints, and attach 
at two opposite gluing points the remaining two copies. 
Note that each edge of Gi has length 4 _l ; we denoted the g 2 
resulting shortest path metric on Gt by dd- As shown 
in |13l Thm. 2.3], the doubling constant of metric space 
(Gi,d,Gi) is at most 6. 

We direct G m as follows. Define the root of G m to be g 3 
(an arbitrarily chosen) one of the two vertices having only 
one adjacent edge. In the figure this could be the leftmost 
vertex r. Note that in no edge the two endpoints are at the same distance from the root. The 
edges of G m are then directed from the endpoint closer to the root to the endpoint further away 
from the root. The resulting directed graph is acyclic. We now define {Xt\f2o ^° ^ e ^ ne standard 
random walk on the directed graph G m , starting from the root. This random walk is extended to 
tGZby stipulating that X t = X for t < 0, and X t = X 4 m for t > 4 m . 

Proposition 3.1. For the random walk defined above, 




^^E[d Gm {X t ,X t (t-2 k )f 

k=o tez 



2 k P 



(29) 
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Proof. For every igZwe have, 



E[d Gm (x t ,x t ^r] 



4T m P t G {0, . . . 4 m 
otherwise. 



1}, 



Hence, 



^2E[d Gm (X t ,X, 



t-ij 



-m(p—l) 



(30) 



Fix A; G {0, . . . , 2m — 2} and write /i = |~ft/2] . View G m as being built from A = G m -h, where 
each edge of A has been replaced by a copy of Gh- Note that for every % G {0, . . . , 4 m - /l_1 + l} ; at 
time t = (4z + 1)4^ the walk Xt is at a vertex of G m which has two outgoing edges, corresponding 
to distinct copies of Gh- To see this it suffices to show that all vertices of G m _h that are exactly 
(4i + 1) edges away from the root, have out-degree 2. This fact is true since G m -h is obtained from 
G m _d_i by replacing each edge by a copy of G\, and each such copy of G\ contributes one vertex 
of out-degree 2, corresponding to the vertex labeled b in the figure describing G\. 

Consider the set of times 



n d ^ f {o, 



, 4 m - 1} p |J [(4i + l)4 h + 4 h ~ 2 , (4i + l)4 h + 2-4* 



i=0 



For t G T k find t G {0, . . . , 4 m - /l ~ 1 + 1} such that t G [(4i + l)4 ft + 4 ft - 2 , (4i + l)4 h + 2 • 4 ft ~ 2 ] . 
Since, by the definition of h, we have t - 2 k G [(4i + l)4 fe - 4 h , (4i + l)4 /l ), the walks {X s } s& 
and — 2 k )} se z started evolving independently at some vertex lying in a copy of Gh preceding 

a vertex v of G m which has two outgoing edges, corresponding to distinct copies of Gh- Thus, 
with probability at least h, the walks Xt and Xf(t — 2 k ) lie on two distinct copies of Gh in G m , 
immediately following the vertex v, and at distance at least 4 fe_2 -4 _m and at most 2-A h ~ 2 -4~ m from 



v. Hence, with probability at least ^ 
and therefore, 



we have d Gm [X t ,X t {t-2 



E 



d(X t ,X t (t-2 k )Y 



2&p 

We deduce that for all ft G {0, ... , 2m — 2}, 



_2(2/i-3-2m)p 



2 k P 



> 2 



-(2m+3)p-l 



)2h-3-2m 



E 



[d(X t ,X t (t - 2 k ))p\ E [d(X t ,X t (t - 2 k )f 



2 k P 



ten 



2 k P 



> \T k \-2 



-(2m+3)p-l 



A combination of (f30|) and (f31~|) implies ([29 



> 4^-2 , ^m—h—l _ q~ (2m+3)p— 1 > -—4~ m (P~ 1) fQI") 



□ 



Proof of Theorem \1.5l As explained in [T2"l fl~3] , by passing to an appropriate Gromov-Hausdorff 
limit, there exists a doubling metric space (X,dx) that contains an isometric copy of all the 
Laakso graphs {G m }^ =0 . Proposition 13. II therefore implies that X is not Markov p-convex for any 

pe(0,oo). □ 
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Proof of Theorem \l.b\ Let (X, dx) be a Markov p-convex metric space, i.e, U P (X) < oo. Assume 
that / : G m — > X satisfies 

x,y G G m id Gm (x,y) < d x (f(x),f(y)) < Bd Gm (x,y). (32) 

Let {Xt}t£i be the random walk from Proposition 13.11 Then 



ea i ^ i[d Gm (x t ,x t (i-2 A )) 3 
^-^2^ E t d G m (^t,^-i) p ] < ^z^z^ ^ 



m ^ v[dx{f(X t )J(X t (t - 2 k ))) p 



fc=o te 

f n p (x)^E[d x (/(x,),/(^_ 1 )n 

Thus AS > m l / p > (log |G TO |) 1/p . □ 

4 Lipschitz quotients 

Say that a metric space (Y,dy) is a D-Lipschitz quotient of a metric space {X, dx) if there exist 
a, b > with ab ^ D and a mapping / : X — > Y such that for all x G X and r > 0, 

By(/(43c/(fl x ( S) r))CBy(/(x),6r). (33) 

Observe that the last inclusion in (|33p is to equivalent to the fact that / is 6-Lipschitz. 
The following proposition implies Theorem 11.41 

Proposition 4.1. If (Y, Dy) is a D-Lipschitz quotient of (X, d x ) then H p (Y) ^ D • U p (X). 

Proof. Fix / : X — > Y satisfying (j33|) . Also, fix a Markov chain {Xt}tez on a state space f2, and a 
mapping g : Q, —> Y. 

Fix m G Z and let 0* be the set of finite sequences of elements of f2 starting at time m, i.e., 
the set of sequences of the form G f2*~ m+1 for all t ^ m. It will be convenient to consider 

the Markov chain {X^}^ m on 0* which is given by: 

P r [^Q* = {^m, W m +li • • • i w t)] = P r [A m = W m , A m+ i = W m +l, . . . ,X t = tO t ] . 



oo 

m 



Also, define 5 * : W -> Y by g*(ui, ...,<*) = g{<Jt). By definition, {g*(X*)}^ m and { 5 (A t )} t c 
are identically distributed. 

We next define a mapping h* : Q* — > X such that / o h* = g* and for all (u m , . . . , wj) G f2*, 

c?x (h*(u} m , . . . ,u t -i),h*(u m , ■ ■ -,ut)) < ad Y (g(ut-i),g(ut))- (34) 



17 



For uj* G O*, we will define h*{uj*) by induction on the length of uj* . If uj* = (uj m ), then we fix 
h*{ui*) to be an arbitrary element in f~ 1 (g(uj rn )). Assume that uj* = {u) m , ■ ■ ■ ,Ut-i,uJt) and that 
h*(oj m , . . . ,L)t-l) has been defined. Set x = f(h*(u) m , . . . ,uj t -i)) = g*(uj m , ■ ■ ■ ,U3t-l) = ff(^t-l) and 
r = ad,Y(g(uJt-i),g{wt))- Since g(ujt) G By (x,r/a), it follows from (j33|) there exists y G X such 



that f(y) = g(uj t ), and d x (x,y) < r. We then define h*((uj m , . . . ,u t -i,ujt)) = f V 
Write X£ = X^ for t ^ m. By the Markov p-convexity of (X, dx), we have 

E \dx{h*{Xt), h*(Xt(t — 2 k ))) p 



EE 



k=0 



2kp 



^ u p (xrJ2 E [dx{h*(x;),h*(xu)f 



(35) 



By (|34p we have for every t ^ m + 1, 

dx(/i*TO,/i*(-X?_i)) ^ ody(sM,<7(X t _i)), 
while for t < m we have d x (h*(X* ), /^(X^)) = 0. Thus, 

^E[dx(/ i *(X t *),/ l *(x;_ 1 )) P ] ^a^E[^( 5 (X0,5(^-i)) p ]- 



(36) 



At the same time, using the fact that / is 6-Lipschitz and / o h* = g*, we see that if t ^ m + 2 k . 

d x [h*{Xt)X{Xt{t-2 k ))) > l l d Y (f{h*(X* t ))J(h*(X* t {t-2 k )))) 

= \d Y {g*{X* t ),g*(X* t {t - 2 k ))) = l -d Y {g(X t ),g^X t (t - 2 k ))) 



Thus, 



EE 

k=o tez 



E 



[dx{h*(X*),h*(X;(t-2 k ))Y 



2kp 



bp 



oo oo g 

E E - 



d Y (g{X,), 9 (X t {t-*))Y 



2kp 



■ (37) 



k=0t=m+2 k 

By combining ([36]) and ([37]) with ([35]) . and letting m tend to — oo, we get the inequality: 

E[d Y ( g (X t ), g (X,(t-2 l '))) p 



EE 

k=0 tez 



2kp 



^(abu p (x)) p J2nM9(x t ),g(x t ^)y}. 



Since this inequality holds for every Markov chain {Xt}t^z and every g : O — >• Y, and since ab sj D, 
we have proved that IL p (Y) ^ DH p (X), as required. □ 



5 A dichotomy theorem for vertically faithful embeddings of trees 

In this section we prove Theorem 11.141 The proof naturally breaks into two parts. The first is the 
following BD Ramsey property of paths (which can be found non-quantitatively in [19j . where also 
the BD Ramsey terminology is explained). 

A mapping <j) : — > JV is called a rescaled isometry if dist(^) = 1, or equivalently there exists 
A > such that d ^(cj)(x), <f>{y)) = \djf(x, y) for all x, y G M '. For n G N let P n denote the n-path, 
i.e., the set {0, . . . ,n} equipped with the metric inherited from the real line. 
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Proposition 5.1. Fix 5 G (0, 1), D > 2 and t,n G N satisfying n ^ £)(4tiogt)/<5_ if f : p n X 
satisfies dist(/) ^ D i/jen i/iere exists a rescaled isometry <p : Pt — >■ P„ swc/i i/iai dist(fo^) ^ l + <5. 

Given a metric space (X, dx) and a nonconstant mapping / : P n — >■ X, define 

r(X)/) dcf rfx(/(0),/(n) = dx(/(0),/(n)) | 

nmax ie { 1) ... jn }dx(/(i- 1), /(«')) re ll/llLi P 

If / is a constant mapping (equivalently majq^n n} dx(f(i—l), /(*)) = 0) then we set /) = 0. 
Note that by the triangle inequality we always have T(X, /) ^ 1. 

Lemma 5.2. For every m, n G N and / : P mn — > X , there exist rescaled isometries (j)^ : P n —> P mn 
and ^ : P m 

— ^ Pmn j such that 

T(X, f) < T (x, f o -T(x,fo . 

Proo/. Fix / : P mn -> X and define <^ m ) : P m -> P mn by c/>( m )(i) = in. Then, 

dx(/(0),/(mn)) ^T(l,/o^H) m max _ l) n ),/(m)). (38) 

V / ie{l,...,m} 

Similarly, for every i G {1, . . . , m} define <^ n ^ : P n — > P mn by <fij"^(J) = (i — l)n + j. Then 
dx(/((*-l)n),/(*n)) ^r(x,/o^) n max d x (f((i-l)n + j-l),f((i-l)n + j)). (39) 

v 7 jG{l,...,n} 

Letting i G {1, . . . , m} be such that T (^X, f o <^ n ^ is maximal, and (p^ = <^ n \ we conclude that 
4(/(0), / H) ? T (l, / o ^W) T fl, / o (jW) mn max d x (f(i - 1), /(*)). □ 

V /V / ie{l,...,mn} 

Lemma 5.3. For every / : P m X we have dist(/) ^ l/T(X,f). 

Proof. Assuming a\i — j\ ^ d x (f(i),f(j)) ^ &|i — il f° r a ^ *>J e ^m, the claim is bT(X,f) ^ a. 
Indeed, am < d x (f(0),f(m)) ^ T{X, /)mmax i=6{1> .„ )m} d x (/(i - 1), /(»)) < T(X,f)bm. □ 

Lemma 5.4. Fix f : P m ^ X . If < e < 1/m and T(X,f) >l-e, then dist(/) < 1/(1 - me). 
Proof. Denote 6 = max i6 r 1) n \ d x (f(i), f(i — 1)) > 0. For every ^ i < j ^ m we have 
d x (f(i)J(j)) ^ YLi+idxW-l),fW) < 6|i-i|, and 

(1 - e)m6 < T(X, f)mb = d x (/(0), /(m)) 

< dx(/(0), /(i)) + d x (/(i) ,/(J)) + dx(/(j), /M) < dx(/(i), f{j)) + b(m + i - j). 
Thus d x (f(i),f(j)) > b{j - i - me) > (1 - ms)b\j - i\. □ 

Proof of Proposition HOI Set k = [log t n\ and denote by I the identity mapping from P t k to P n . 
By Lemma 15.31 we have T(X,f o I) 1/D, An iterative application of Lemma 15.21 implies that 
there exists a rescaled isometry <j> : Pt — > P t fc such that 

c 

T{XJ olocj)) > D- l ' k > e -21ogD/log t n ^ e -5/(2t) ^ j _ 

By Lemma 15.41 we therefore have dist(/ o I o 0) ^ 1/(1 — (5/2) ^1 + 5. □ 
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The second part of the proof of Theorem 11.141 uses the following combinatorial lemma due to 
Matousek [18]. Denote by Tfc >m the complete rooted tree of height m, in which every non-leaf vertex 
has k children. For a rooted tree T, denote by SP(T) the set of all unordered pairs {x, y} of distinct 
vertices of T such that x is an ancestor of y. 

Lemma 5.5 ( |18l Lem. 5]). Let m,r,k G N satisfy k ^ r (m+i) 2 _ Suppose that each of the pairs 
from SP(Tfc m ) is colored by one of r colors. Then there exists a copy T' of B m in this T& m such 
that the color of any pair {x,y} G SP(T') only depends on the levels of x and y. 

Proof of Lemma \1.14\ Let / : B n -flbea D- vertically faithful embedding, i.e., for some A > 
it satisfies 

\d Bn (x,y) < d x (f(x),f(y)) ^ DXd Bn (x,y) (40) 

whenever x, y G B n are such that x is an ancestor of y. 

Let k,£ G N be auxiliary parameters to be determined later, and define m = [n/(k£)\. We 
first construct a mapping g : T 2 k m — > B n in a top-down manner as follows. If r is the root of 
T 2 k m then g(r) is defined to be the root of B n . Having defined g(u), let v%,..., v 2 k G T 2 k m be the 
children of u, and let wi, ■ ■ ■ , w 2 k G B n be the descendants of g(u) at depth k below g(u). For each 
i € {1, . . . , 2 k } let g(vi) be an arbitrary descendant of W{ at depth h(g(u)) + Ik. Note that for this 
construction to be possible we need to have m£k ^ n, which is ensured by our choice of m. 

By construction, if x, y G T 2 k m and x is an ancestor of y, then g(x) is an ancestor of g(y) 
and dB„(g(x), g(y)) = £kdx 2k (x,y). Also, if x,y G T 2 k^ m and \ca(x,y) = u, then we have 
h(\ca(g(x),g(y))) G {h(g(u)), h(g{u)) + 1, . . . , h(g{u)) + k - 1}. This implies that 

((£-l)k + l)d T2k Jx,y)^d Bn (x,y)^ekdT 2k Jx,y). 

Thus, assuming I ^ 2, we have dist(<?) ^ 1 + 2/£. Moreover, denoting F = f o g and using (j4*0|) . we 
see that if x, y G T 2 k m are such that x is an ancestor of y then 

WAdr^ Ji,!/) < dx^Cx),^!/)) < D£kXd T2k Jx,y). (41) 

Color every pair {x,y} G SP(T 2 fe ) with the color 

G 11 rl 

■ y KiAUT 2k m {x,y) j 
where r = [log 1+ ^ 4 -D] . Assuming that 



x({x,y}) = f 



/ dx(F(x),F(?/)) \ 
bg ^ [k£Xd T2k Jx,y) 



2 fe ^r (m+1)2 , (42) 

by Lemma [531 there exists a copy T' of 5 m in T 2 k m such that the colors of pairs {x, y} G SP(T') 
only depend on the levels of x and y. 

Let P be a root-leaf path in T 1 (isometric to P m )- The mapping F\p : P — > A" has distortion 
at most .D by (|4"T|) . Assuming 

m ^ D 16 (« g*)/ 5 , (43) 

by Proposition 15.11 there are {xj}- =0 C P such that for some a, 6 G N with a,a + tb £ [0, m], for all 
i we have Ti(iCi) = a + ib, and for some # > 0, for all i,j G {0, . . . , t}, 

Obli-jl^dxiFixilFixj))^ (l + ^\eb\i-j\. (44) 



20 



Define a rescaled isometry ip : Bt — > T' in a top-down manner as follows: f(r) = Xo, and 
having defined (p(u) € T', if v,w are the children of u in £?t and v',w' are the children of <p(u) 
in T", the vertices ip(v),ip(w) are chosen as arbitrary descendants in T' of i/,u/ (respectively) at 
depth h{ip(u)) + b. Consider the mapping G : B t — > X given hyG = Foip = fogo{p. Take 
x,y £ Bt such that x is an ancestor of y. Write = z and h{y) = j. Thus h((p(x)) = a + ib and 
h(ip(y)) = a + jb. It follows that {f(x), (p(y)} is colored by the same color as {xi,xj}, i.e., 



log! 



f d x (G(x),G(y)) \ 



' 1+ 4 y k£XbdB t (x,y) J 
Consequently, using (jH| we deduce that 



x(MaO,¥>(y)}) = x({xi,Xj}) 



log!. 



4 / <i x (F(x,),F(^)) 
"4 y k£XbdB t (x, y) 



2 



:d Bt (x,y)^d x (G(x),G(y)) < ( 1 + 7 ) Obd Bt {x,y). 



1 + 5/4 " " v v " V 4 

Thus G is a (1 + 5/4) 3 ^ 1 + 5 vertically faithful embedding of B t into X. 

It remains to determine the values of the auxiliary parameters £, k, which will lead to the desired 
restriction on n given in ([9]). First of all, we want to have dist(g o cp) ^ 1 + £• Since 99 is a rescaled 
isometry and (for £ ^ 2) dist(g) ^ 1 + 2/£, we choose £ = |~2/£] ^ 2. We will choose k so that 
4k ^ n£, so that nj(k£) ^ 1. Since m = [n/(k£)\, we have m + 1 ^n^/k and m ^ n^/{4k). Recall 
that r = [log 1+(5 / 4 D] ^ 21og 1+5 / 4 D ^ IQD/5. Hence the requirement (|4"2|) will be satisfied if 



2 fc3 > \=f-J ■ (45) 

and the requirement (ji"3|) will be satisfied if 

< ^ ^leCtlogt)/^ ^ 46 \ 
4k ^ 

There exists an integer k satisfying both (|4"5|) and (|46p provided that 

/16L>\\ 1/3 nf 



16DV 



f 2,2, A6£>\\ 1 < 

{ n t lo ^{—)) +ls ^ 4D mio S t)/s - 



which holds true provided the constant c in ([9]) is large enough. □ 



6 Tree metrics do not have the dichotomy property 

This section is devoted to the proofs of Theorem 11.101 and Theorem [LT2J These proofs were outlined 
in Section [1,1. H and we will use the notation introduced there. 

6.1 Horizontally contracted trees 

We start with the following lemma which supplies conditions on {e n }^ =0 ensuring that the H-tree 
(i?oo,d e ) is a metric space. 

Lemma 6.1. Assume that {e n } ( £ = Q C (0, 1] is non-increasing and {ne n } c £' = Q is non- decreasing. 
Then d £ is a metric on B^ 
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Proof. Take x,y,z S and without loss of generality assume that h(x) ^ h(y). We distinguish 
between the cases h(z) > h(y), h(x) ^ h(z) ^ h(y) and /i(z) < h{x). 
If > h(y) then 

d E (x,z) + d £ (z,y) - d £ (x,y) 

= 2[h(z) - h(y)] + 2e h (a.) • [/i(lca(x, y)) - h(\ca(x, z))] + 2e h{y) ■ [h(y) - h(\ca(z, y))] 

^ 2e h(x) ■ [h(\ca(x, y)) - /i(lca(x, z))] + 2e h(y) ■ [h(y) - h(\ca(z, y))] . (47) 

To show that (|37|) is non-negative observe that this is obvious if h(\ca(x,y)) ^ h(\ca(x, z)). So 
assume that h(\ca(x,y)) < h(\ca(x, z)). In this case necessarily h(\ca(z, y)) = h(\ca(x,y)), so we 
can bound (jJTJ) from below as follows 

^h(x) ■ [h(\ca(x, y)) - h(\ca(x, z))] + 2e h{ , y) ■ [h(y) - h(\ca(z, y))\ 

h(x) 

> 2 ^h{x) ■ [h(\ca(x, y)) - h(x)\ + 2—--e h ^ x) ■ [h(y) - h(\ca(x, y))] 



= 2s Hx yh(\ca(x,y))(l-^j^0. 
If h(z) < h{x) then 



d e (x, z) + d £ {z, y) = h(x) + h(y) - 2h(z) + 2e h(z) ■ [2h(z] - h(\ca(x, z)) - h(\ca(z, y))] 

> h(y) - h(x) + 2e h{x) ■ [h(x) - h(z)\ + 2e h(x) ■ [2h(z) - h(\ca(x, z)) - h(\ca(y, z))] 
= h(y) - h(x) + 2e h{x) ■ [h(x) + h(z) - h(\ca(x, z)) - h(\ca(y, z))} 

> h{y)-h{x)+2e hix y[h(x)-h(\ca{x,y))} (48) 
= d £ (x,y). 

Where in (I48p we used the fact that h{z) ^ h(\ca(x, z)) + h(\ca(y, z)) — h(\ca(x,y)), which is true 
since h(\ca(x, y)) ^ min {h(\ca(x, z)), h(\ca(y, z))}. 

It remains to deal with the case h(x) ^ h(z) ^ h(y). In this case 



d £ (x, z) + d £ (z, y) = h(y) - h(x) + 2e h(x) ■ [h(x) - h(\ca(x, z))} + 2e h{z) ■ \h(z) - h(\ca(y, z))\ 

h(x) 

^ h{y) - h(x) + 2e h(x) ■ \h(x) - h(\ca{x, z))\ + 2 jj^ e Kx) ■ Hz) - h(\ca(y, z))\ 



= h(y) - h(x) + 2e h ( x) ■ 2h(x) - h(\ca(x, z)) 

> h(y) - h(x) + 2e h{x) ■ [h(x) - h(\ca(x, y))] 
= d £ (x,y), 



h(x 



h(x) 



h(\ca(y,z)) 



(49) 



where (1491) is equivalent to the inequality 



h(x) ^ h(\ca(x, z)) + — — -h(\ca(y, z)) - h(\ca(x, y)). 
ti{z) 



(50) 



To prove ([50]) . note that it is true if h(\ca(x, v)) ^ h(\ca(x, z)), since clearly h(\ca(y, z)) ^ h(z). If, on 
the other hand, h(\ca(x,y)) < h(\ca(x, z)) then using the assumption that h(z) ^ h(x) it is enough 
to show that h(x) ^ h(\ca(x, z)) + h(\ca(y, z)) — h(\ca(x, y)). Necessarily h(\ca(x, y)) = h(\ca(y, z)), 
so that the required inequality follows from the fact that h(x) ^ h(\ca(x, z)). □ 
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6.2 Geometry of //-trees 



6.2.1 Classification of approximate midpoints 

From now on we will always assume that e = {e n }'^' = Q satisfies for all n G N, e n ^ £ n +i > 
and (n + l)e n +i ^ ne n . We recall the important concept of approximate midpoints which is used 
frequently in nonlinear functional analysis (see [lj and the references therein). 

Definition 6.2 (Approximate midpoints). Let (X,d x ) be a metric space and 5 G (0,1). For 
x,y G X the set of 5 -approximate midpoints of x and z is defined as 

Mid(x,z,5) = jy G X : max{d x (x, y), d x (y, z)} ^ —y- ' dx(x,z) j • 

From now on, whenever we refer to the set Mid(x, z, 5), the underlying metric will always be 
understood to be d £ . In what follows, given r\ > we shall say that two sequences (u±, . . . , u n ) and 
(«!,..., v n ) of vertices in B^ are n-ne&r if for every j G {1, . . . , n} we have d £ (uj,Vj) ^ 77. We shall 
also require the following terminology: 

Definition 6.3. An ordered triple (x, y, z) of vertices in B^ will be called a path-type configuration 
if h(z) ^ h(y) ^ h{x), x is a descendant of y, and h(\ca(z,y)) < h(y). The triple (x,y,z) will be 
called a tent-type configuration if h(y) si h(z), y is a descendant of x, and h(\ca(x, z)) < h{x). 
These special configurations are described in Figured 





Path-type 



Tent-type 



Figure 3: A schematic description of path-type and tent-type configurations. 

The following useful theorem will be used extensively in the ensuing arguments. Its proof will 
be broken down into several elementary lemmas. 

Theorem 6.4. Assume that 5 G (0, and the sequence e = {e n }^L satisfies e n < \ for all 
n G N. Let x,y,z G (-Boo,d £ ) be such that y G Mid(x,z, S). Then either (x,y,z) or (z,y,x) is 
35d £ (x, z)-near a path-type or tent-type configuration. 

In what follows, given a vertex v G i?oo we denote the subtree rooted at v by T v . 

Lemma 6.5. Assume that e n ^ \ for all n. Fix a G B^ and let u,v G -Boo be its children. For 
every x,z G T u such that h(x) ^ h(z) consider the function D x ^ z : {a} U T v — > [0, 00) defined by 
D x ,z(y) = d £ (x,y) + d £ (z,y). Fix an arbitrary vertex w G T v such that h(w) = h{z). Then for every 
y GT„ we have D XjZ (y) ^ D x>z (w). 
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Proof. By the definition of d £ we have D x ^ z {y) = Q(h(y)) where 

Q(k) = max{/i(x), k} + max{fc, h(z)} — mm{h(x), k} — min{/c, h(z)} 

+ 2£mm{h(x),k} [min{/i(x), k} - h(a)} + 2e min{kMz)} {mm{k, h(z)} - h(a)] . 

The required result will follow if we show that Q is non-increasing on {h(a), h{a) + 1, . . . , h(z)} and 
non-decreasing on {h(z), h(z) + 1, . . .}. If k G {h(a), h(a) + 1, . . . , h(z) — 1} then 

Q(k + 1) - Q(k) = -2 + As k+l [k + 1 - h(a)] - 4e k [k - h(a)] 

-2 + Ae k [k + 1 - h(a)} - Ae k [k - h(a)] = -2(1 - 2e k ) 0. 

If k G {h(x), h(x) + 1, . . .} then Q(k + 1) - Q(k) = 2, and if k G {h(z), h(x) - 1} then 

Q{k + 1) - Q{k) = 2 \{k + l)e fc+ i - te fc ] + 2/i(a)[e fc - e k+1 ) ^ 0. 

This completes the proof of Lemma 16.51 □ 

Lemma 6.6. Assume that e n < \ for all n G N. Fix 5 G (0, ^) and x,y,z G -Boo smc/i £/iat 
/i(x) ^ /i(^), y G Mid(x, z,5) and h(\ca(x, z)) > /i(lca(x, y)). Then 

Hz) + l -^d £ {x, z) < %) < h(x) < %) + - %)]• 

2 1 — So 

Moreover, if y' G -Boo is the point on the segment joining x and \ca(x,y) such that h{y') = h{y) then 
de{ViV') ^ 5d £ (x,z). Thus (x,y',z) is a path-type configuration which is 5d £ (x, z)-near (x,y,z) 

Proof. Write a = lca(x, y). If u, v are the two children of a, then without loss of generality x,z G T u 
and y G T v . Let w G T v be such that h(w) = h{z). 

By Lemma 16.51 « 

d £ (x,y) + d £ (z,y) ^ d £ (x,w) + d £ (z,w) / \ 

= /i(x) - + Ae h{z) [h(z) - h(a)} ^ 2d £ {x,z) - [h(x) - h{z)]. (51) A \ 

/ \ w 

On the other hand, since y G Mid(x, z, 5), we have that / z \ 

2^- • y 

d E (x,y) + d E (z,y) ^ (l + (J)4(x,2). / 

.T • 

Additionally, by the definition of d £ we know that if h{y) ^ h(z) then 

^-^d e (x, z) ^ y) ^ h(x) - h(y) ^ /i(x) - h(z). 
Combining these observations with (|5T|) we get that 

(l + 5)d £ (x,z) > 2d £ (x,z) - l±-de(x,z), (52) 
which is a contradiction since S < |. Therefore /i(y) > /i(z). If /i(y) ^ /i(x) then 

i^— 4(x, z) ^ 4(^, y) ^ /i(y) - h(z) ^ /i(x) - h(z), 
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so that we arrive at a contradiction as in (|52p . We have thus shown that h(z) < h(y) < h{x). 
Now, since y G Mid(x, z, 5), 

h(x) - h(z) + 2e h{y) [h(y) - h(a)) + 2e h ^ z) [h(z) - h(a)) = d e (x, y) + d £ (z, y) 

< (1 + 6)d £ {x, z) = (l + 5) (h(x) - h(z) + 2e h{z) [h{z) - h{a)\) . 

Thus, letting y' be the point on the segment joining x and a such that h(y') = h(y), we see that 

d £ (y,y') = 2e h{y) [h(y) - h(a)\ < 6 (h(x) - h(z) + 2e h{z) [h{z) - h(a)\) = 5d £ (x,z), 

Moreover 

^——d £ (x, z) ^ d £ (y,z) = h(y) - h(z) + 2e h(z) h(z) - 2e hlyZ) h{a) 

^ h(y) - h(z) + 2s h ( y )h(y) - 2e h ^h(a) ^ h(y) - h(z) + 5d e (x,z). 



Thus 



Hence, 



1 35 

h{y) - h(z) ^ -^—d £ {x, z). (53) 



2 <G3 

- r h(y) - h{z)\ > d £ (x, z) = [h(x) - h(y)} + [h(y) - h(z)}. 



1-35 
It follows that 

h(x) - h(y) ^ ^±^[h(y) - h(z)}. 

This completes the proof of Lemma 16,61 □ 

Lemma 6.7. Assume that e n < \ for all n G N. Fix 5 G (0, j^) and assume that x,y,z G 
are distinct vertices such that lca(x,y) = lca(x,2:) ; and y G Mid(x,z,5). Then either (x,y,z) or 
(z,y,x) is 35d £ (x, z)-near a path-type or tent-type configuration. 

Proof. Denote a = \ca(x,y). Our assumption implies that h(\ca(z, y)) ^ h{a). We perform a case 
analysis on the relative heights of x,y, z. Assume first that h{x) ^ h(y). 

If h{x) ^ h{y) ^ h{z) then - a 

(1 + 5)d £ (x,z) ^ d £ (x,y) + d £ (y,z) / \ 

= h{y)- h(x) + 2e h(x) [h(x) - h(a)] / \ ^ Z) 

+ h(z) - h(y) + 2e h{y) [h{y) - h(\ca{z,y))} J- i\ 

= d £ {x,z) + 2e h(y) [h(y) - h(\ca(z,y))}. (54) y'l--\y 

Let y' be the point on the path from lca(y,z) to z such that h(y') = h(y). I 
Then (f54"j) implies that 

d s (y,y') = 2e h ( y) [h(y) - h(\ca(z,y))} ^ 5d £ (x,z). 

Thus the triple (z,y',x) is a configuration of path-type which is 5d £ (x, z)-near (z,y,x). 
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If h(x) ^ h(z) ^ h(y) then since 

^—d e (x, z) ^ d £ (x, y) = h(y) - h(x) + 2e h{x) [h(x) - h(a)\ 
and d £ (x, z) = h(z) — h{x) + 2s h ^[h(x) — h(a)] we deduce that 



1-5 



d £ (x, z) ^ d £ (x, y) - d £ (x, z) = h(y) - h(z) ^ 0. 



It follows that x = z, in contradiction to our assumption. 

If h(z) < h(x) then let z' be the point on the segment joining a and y such 
that h(z') = h{z). We thus have that 

d £ (z,z') = 2e h(z) [h(z) - h(\ca(y,z))] = d £ (z,y) - [h(y) - h{z)\. 

Moreover, 

2e h{z) [h(\ca(z,y)) - h(o}] = d £ (x,z) - [h(x) - h(z)} - 2e h(z) [h(z) - h(\ca(z,y))] 

2 

:d £ (z, y) - [h(x) - h(z)\ - 2e h(z) [h(z) - h(\ca(z, y))} 



(h(y) - h(z) + 2e h{z) [h(z) - h(\ca(y, z))}) 



1 + 5 
2 

T+s 

- (%) - h(z) + 2e h(z) [h(z) - h(\ca(y, z))}) + [h(y) - h(x)] 
^-^d £ (y,z) + [h(y)-h(x)} 
1-5 1-5 



1 + 5 



1-5 



d £ (x,z) + [h(y) - h(x)] 



d £ (x,y) + [h(y) - h(x)] 
46 



1 + 5 

d £ (x,y) + [h(y) - h{x)\ 



(1 + S) 2^y) 



2[h(y)-h(x))+2e h{x) [h(x)-h(a)) 



Td £ (x,y) 



Thus 



25 
T+5 



lca(j/, z) 



\ca{y,z) 




(1+6) 2 

25 

> 2[%) - h(x)\ + 2e h ( z) h(z) - 2e h{z) h(a) - ^-^d s {x, z). 



d £ (x, z) > 2[h{y) - h(x)] + 2e h(z) [h{z) - h{\ca{y, z))} = 2[h(y) - h(x)] + d £ (z, z'). (55) 



Let y' be the point on the path from o to y such that h{y') = h(x). It follows from (|55p that the 
triple (z',y',x) is a configuration of tent-type which is 25d £ (x, z)-near (z,y,x). 

This completes the proof of Lemma 16,71 when h(x) ^ h(y). The case h{x) > h{y) is proved 
analogously. Here are the details. 
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Assume first of all that h(z) ^ h(x) > h(y). Then, 

2 2(5 
d £ (x,z) ^ Y~-^d £ {z,y) = 2d £ (z,y) - Y^d £ {z,y) / \\a(y,z) 

2(5 

= 2 (h(z) - h(y) + 2e h{y) [h(y) - h{\ca(z,y))}) - ^-^d £ (z,y) 

^ 2 (h(z) - h(y) + 2e h{y) [h(y) - h(\ca(z,y))}) - 5d £ (x, z). (56) 

On the other hand, since h{x) > h(y), 

d £ (x,z) = h(z) - h(x) + 2e h ( x) [h(x) - h(a)] 

< h(z)-h(x) + 2e h(y) [h(x)-h(a)} 

= (h(x) - h(y) + 2e h{y) [h(y) - h(a)}) + h(y) + h(z) - 2h(x) + 2e h{y) [h(x) - h{y)\ 

= d £ {x, y) + h{y) + h{z) - 2h{x) + 2e h(y) [h(x) - h{y)\ 

^ j^Ts d e(y, *) + Kv) + K*) ~ 2 K X ) + 2£ h(y) [h(x) - h(y)] 

2(5 

= (h(z) - h(y) + 2e h{y) [h(y) - h(\ca(z, y))]) + j^4(y, z) 
+h{y) + h{z) - 2h(x) + 2e h{y) [h{x) - h(y)] 
2[h(z) - h{x)} + 2e m [h{x) - h(\ca(z,y))} + ^5d £ {x, z). 



Combining this bound with (|56jh and canceling terms, gives 
28 

— -d E (x,z) ^ 2[h(x) -h(y)] - 4e h(y) [h(x) - h(y)] + 2e h{y) [h(x) - h(\ca(z, y))] 

> 2 (1 - 2e m ) [h{x) - h{y)} + 2e h{y) [h{x) - h(\ca(z, y))] 

> [h(x)-h(y)] + 2e h{y) [h(x)-h(\ca(z,y))}, (57) 

where we used the fact that £h(y) < f ■ Let x' be the point on the path from a; to a such that 
h(x') = h(y), and let y' be the point on the path from a to z such that h(y') = h(y). Then by (f57|) 

d £ (x,x') = h(x) — h(y) ^ 35d £ (x,z) and 

d £ (y,y') = 2e h(y) [h{y) - h(\ca(z,y))] ^ 2e h{y) [h(x) - h(\ca(z,y))] ^ 35d £ (x,z). 

Thus the triple (z, y' , x') is a configuration of path-type which is 35d £ (x, z)-near (z, y, x). 
If h(x) > h{z) > h{y) then J 

/i(z) - %) + 2e /i(j/) [%) - /i(lca(z, y))] = 4(^,2/) ^ —-^4(^,2/) / Vlca(j/,z) 

2(5 2(5 / / \ 

= 4(^,y)-Y^ d e( x 'y) = H x )-Ky)+ 2£ h( y )[h(y)- h ( a )}-Y^r§ d ^ x ^y^ L---L\ y 

Canceling terms we see that L j z 

25 



1 + 5 



d £ (x,y) ^ h{x) - h{z) + 2e h{y) [h(\ca(z,y)) - h(a)] 

= h(x) - h(z) + 2e h{y) [h(z) - h(a)] - 2e h{y) [h(z) - h(\ca(z,y))] 
^ h(x) - h(z) + 2e h ( z) [h(z) - h(o)} - 2£ h{y) [h(z) - h(\ca(z,y))] 
= d £ (x,z) - 2£ h{y) [h(z) - h(\ca(z,y))) 
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25 

> 2d £ (z,y) - Y^d £ (x,z) - 2e h{y) [h(z) - h(\ca(z,y))] 

2(5 

= 2 (h(z) - h{y) + 2e h(y) [h(y) - h(\ca{z, y))}) - jXjdefo z ) 
- ^h{ y )[K z ) ~ h(\ca(z,y))] 

2,6 

= 2 (1 - 2e m ) Hz) - h{y)\ + 2e h{y) [h(z) - h(\ca(z, y))} - z) 

2(5 

> i h ( z ) ~h(y)] + 2 ^h(y)[H z ) - h(\ca(z,y))] - ——^defaz) 

25 

= d £ (z,y) - Y^-§d £ (x,z) 

which is a contradiction since 5 < 

The only remaining case is when h(x) > h(y) > h(z). In this case we 
proceed as follows. 

d £ (x, z) = h(x) — h(z) + 2Eh( z ) \h{z) — h(a)] 

= d £ (y,z) + [h(x) - h(y)\ + 2e h(z) [h(\ca(y, z)) - h(o)] 
2(5 

^ d £ (x,y) - z^-^d e {x,y) + W x ) ~ h (v)] + 2£ h(z)[h(^a(y,z)) - h(a)] 
^ h(x) - h(y) + 2e hiy) [h(y) - h(a)] - 5d £ (x,z) 

+ [h{x] - h{y)} + 2e h(z) [h(\ca(y, z)) - h{a)} 
> 2[h(x) - h{y)} + 2e h(z) [h(z) - h(a)] + 2e h(z) [h{\ca(y, z)) - h(a)} - 5d £ (x,z) 
= 2[h(x) - h(y)] + 4e h ( z )[h(z) - h(a)} - 2e h{z) [h(z) - h{\ca(y,z))} - 5d £ (x,z) 
= 2d £ (x,z) - 2d £ (z,y) + 2e h(z) [h(z) - h(\ca(y,z))} - 5d £ (x,z) 
^ (l-25)d £ (x,z) + 2e h(z) [h(z)-h(\ca(y,z))]. (58) 

Let z 1 be the point on the path from a to y such that h(z') = h(z). Then 

(55J) 

d £ (z,z') = 2e h ( z) [h(z) - h(\ca(y,z))} < 25d £ (x,z). 

Therefore the triple (z', y, x) is of tent-type and is 25d £ (x, z)-near (z, y, x). The proof of Lemma [6. 71 
is complete. □ 




Proof of Theorem \6.4\ It remains to check that for every x,y,z 6 such that y € Mid(x, z, 5), 
at least one of the triples (x,y,z) or (z,y,x) satisfies the conditions of Lemma 16.61 or Lemma 16.71 
Indeed, if h(\ca(x,y)) = h(\ca(x,z)) then \ca(x,y) = \ca(x,z), so Lemma 16.71 applies. If 
h(\ca(x,y)) < h(\ca(x, z)) then \ca(z,y) = \ca(x,z), so Lemma [6.71 applies to the triple (z,y,x). 
If h(\ca(x,y)) < h(\ca(x,z)) then \ca(x,y) = \ca(z,y), and so h(\ca(x, z)) > h(\ca(z,y)). Hence 
Lemma [6761 applies to either the triple (x,y,z) or the triple (z,y,x). □ 
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We end this subsection with a short discussion on the distance between tent-type and path-type 
configurations. It turns out that when e/, <C 5, a <5-midpoint configuration (x, y, z) can be close 
to a path-type configuration, and at the same time the reversed triple (z, y, x) close to a tent-type 
configuration (or vice versa). However, it is easy to see that this is the only "closeness" possible. 

Lemma 6.8. Fix x,y,z € -Boo with x ^ y. Then the following statements are impossible: 

1. (x,y,z) is ^d £ (x,y)-near a path-type configuration and a tent-type configuration. 

2. (x, y, z) is j^d £ (x, y)-near a path-type configuration and (z, y, x) is jfd £ (x, y)-near a path-type 
configuration. 

3. (x,y,z) is jjd £ (x,y)-near a tent-type configuration and (z,y,x) is j^d £ (x,y)-near a tent-type 
configuration. 

Proof. For case 1 of Lemma 16.81 assume for contradiction that (x,y,z) is |d e (x,y)-near a path- 
type configuration (oi,6i,ci) 3 and also ^d e (x, y)-near a tent-type configuration (ai,/3i,7i). By the 
definitions of path-type and tent-type configurations, a\ is a descendant of b\ and f3\ is a descendant 
of a\. Hence, 

3 

h(ai) - h(h) = d e (ai,h) ^ d £ (x,y) - d £ (x,ai) - d e (y,bi) ^ -d £ (x,y), (59) 

5 

and 

3 

h(fii) - h(ai) = de(ax,Pi) ^ d £ (x,y) - d £ (x,a 1 ) - d E {y,f}\) ^ -d £ (x,y). (60) 

5 

By summing (|59|) and (|60p we see that, 
4 

-d e (x,y) ^ d £ (ai,x) + d e (x,a>i) + d £ (bi,y) + d £ {y,j3 1 ) ^ d e {ai,a{) + d £ (b 1 ,fi 1 ) 
5 

> h(ai) - h(ai) + hfa) - h(bi) ^ -d £ {x,y), 

5 

a contradiction. 

For case 2 of Lemma 16.81 assume for contradiction that (x,y,z) is jjd £ (x, y)-near a path-type 
configuration (02,62,62), and also (z,y,x) is jjd £ (x, y)-near a path-type configuration {a. 2 , fh,"]?)- 
By the definitions of path-type and tent-type configurations, a 2 is a descendant of 62 and h(f3 2 ) > 
h{^ 2 ). Hence, 

9 

h(a 2 ) - h(b 2 ) = d £ (a 2 ,b 2 ) > d E (x,y) - d £ (x,a 2 ) - d £ (y,b 2 ) > — d £ (x,y), (61) 

and 

h(h) - h( l2 ) + 2e h{l2) [h{ l2 ) - Mlca(/3 2 ,72))] = 4(^2,72) 

9 

> d £ (x,y) - 4(^,72) - d £ (y,(3 2 ) ^ —d £ (x,y). (62) 
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By summing (|61j) and (|62[) we see that 
17 

— d £ (x,y) ^ 4(a2,x) + 4(2,72) + d £ (b 2 ,y) + d £ (y,/3 2 ) + d £ (f3 2 ,y) + d £ (x,y) + d £ (x,j 2 ) 

> d £ (a 2 , 72) + de(b2,fo) +4(^2,72) 

> (% 2 )-M72)) + (M^)-M&2)) +2 £ft(72) [/ l ( 72 )-^(lca(/3 2 , 72 ))] 

> ^4(^,2/), 

a contradiction. 

For case 3 of Lemma HTSl assume for contradiction that (x,y,z) is j^d £ (x, y)-near a tent-type 
configuration (03,63,03), and also (z,y,x) is Yj-(f e (x, y)-near a tent-type configuration (03, {3 3 , 73). 
Then 63 is a descendant of 03 and ^(73) > h((3 3 ). Hence, 

9 

h(b 3 ) - h(a 3 ) = d £ (a 3 ,b 3 ) ^ d £ (x,y) - d £ (x,a 3 ) - d £ (y,b 3 ) ^ — d £ (x,y), (63) 

and 

K73) ~ h(p 3 ) + 2e m) [h(fi 3 ) - h(\ca((3 3 , l3 ))} = d £ (/3 3 , l3 ) 

9 

> d £ (x,y) - d E (x,j 3 ) - d £ (y,(3 3 ) ^ — d £ (x,y). (64) 

Hence, 
17 

—d e (x,y) > d £ (b 3 ,y) + 4(y, /3 3 ) + d e (x, a 3 ) + d £ (x,^ 3 ) + d £ (f3 3 ,y) + d £ (x,y) + d £ (x,^ 3 ) 

> d £ {b 3 ,(3 3 ) + d £ (a 3 ,j3) + d £ (/3 3 , l3 ) 

Z (h(b 3 ) - h((5 3 )) + (h( l3 ) - + 2e hm [h(f3 3 ) - h(\ca((3 3 , l3 ))} 

|[63jAl|6U 18 

^ — 4(a,2/), 

a contradiction. □ 
6.2.2 Classification of approximate forks 

We begin with three "stitching lemmas" that roughly say that given three points x, x' , y € (Boo, d £ ) 
such that x' is near x, there exists y' near y such that d £ (x',y') is close to d £ (x,y), and y' relates 
to x' in Boo "in the same way" that y relates x in Boo- 

Lemma 6.9. Let x,x',y,y' € Boo be such that y is an ancestor of x, and y' is an ancestor of x' 
satisfying h(x) — h(y) = h(x') — h(y'). Then d £ (y,y') ^ d £ (x,x'). 

Proof. Assume without loss of generality that h(x) ^ h(x'). So, 

d £ (x,x') = h(x) — h(x') + 2e fe ( x /)[/i(x') — h(\ca(x, x'))\. 

Note that h(\ca(y,y')) = mm{h(y'),h(\ca(x,x'))}. Hence, 

d £ (y,y') = h(y) - h(y') + 2e h(yl) [h(y') - h(\ca(y,y'))] 

= h(x) -h(x') + 2e h(y/) [h(y') - mm{h(y'),h(\ca(x,x'))}} 

= h(x) — h(x') + 2e h ( y i-} max{0, h(y') — h(\ca(x, x'))}. (65) 
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If the maximum in (|65p is 0, then 

d £ (y,y') = h(x) - h(x') ^ d £ (x,x'). 
If the maximum in ([65]) equals h(y') — h(\ca(x,x')), then 

d E (y,y') = h(x) - h(x') + 2e h{yl) [h{y') - h(\ca(x,x'))\ 

«S h(x) -h(x') + 2e h(xl) [h(x') - h(\ca(x,x'))} = d £ (x,x'), (66) 

where in ([66|) we used the fact that the sequence {s n {n — a)}^ =0 is nondecreasing for all a ^ 0. □ 

Lemma 6.10. Let x,x',y £ B^ be such that h{y) ^ h(x). Then there exists y' G B^ which 
satisfies h{y') — h(x') = h{y) — h{x), 

d e (y,y')^d £ (x,x'), (67) 

and 

d £ (x,y) -2d £ (x,x') ^ d £ {y ,x) ^ d £ (x,y) +2d £ (x,x'). (68) 

Proof. Note that (f68|) follows from (|67p by the triangle inequality. Assume first that h(x) ^ h(x'). 
In this case choose y' to be an ancestor of y satisfying h(y) — h(y') = h(x) — h(x'). Then, 

d £ (y,y') = h(y) - h(y') = h(x) - h(x') ^ d £ (x,x'). 

We next assume that h{x) < h{x'). If h(\ca(x, x')) ^ h(\ca(x,y)) then choose y' to be an 
arbitrary descendant of y such that h(y') — h(y) = h{x') — h(x). As before, we conclude that 
d s (y,y') = h(y') - h(y) = h(x') - h(x) d £ (x,x'). 

It remains to deal with the case h{x') > h{x) and h(\ca(x,y)) = h(\ca(x', x)), which also implies 
that h(\ca(x' , y)) > h(\ca(x,y)). In this case, we choose y' to be an arbitrary point on a branch 
containing both lca(x,y) and x, such that h(y') — h{y) = h{x') — h{x). Then \ca(y',y) = lca(x,x'), 
and therefore, 

d £ (y,y) = h(y')-h(y)+2e h{y) [h(y)-h(\ca(y,y')} 
= h(x') - h(x) + 2e h{y) [h(y) - h(\ca(x,x'))} 
^ h(x') — h(x) + 2e h ^[h(x) — h(\ca(x, x'))] 
= d £ (x,x'), 

proving (|67p in the last remaining case. □ 

Lemma 6.11. Let x,x',y £ B^ be such that y is a descendant of x. Then for any y' £ B^ which 
is a descendant of x' and satisfying h(y') — h(x') = h(y) — h(x), we have 

d £ {y,y') ^ d £ (x,x') + 2e min{hiy/)My)} [h(y) - h(x)} ^ d £ (x,x') + 2e h(y) [h{y) - h(x) + d £ (x,x')}. 

Proof. Note that h(\ca(y,y')) ^ h(\ca(x, x')). Assume first that h(x') ^ h(x). Then, 

d £ (y,y') = h(y') - h(y) + 2e h(y) [h(y) - h(\ca(y,y'))] 

= h(x') - h(x) + 2e h{y) [h{x) - h(\ca(y,y'))] + 2e h(y) [h{y) - h(x)} 
^ h(x') - h(x) +2e h(y) [h(x) - h(\ca(x,x')] + 2e h{y) [h(y) - h(x)} 
^ d £ (x,x') + 2e h ( y - ) [h(y) - h(x)]. 
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When h(x') < h(x), we similarly obtain the bound: 

d £ (y,y') = h(y)-h(y') + 2e h{yl) [h(y')-h(\c 3 (y,y'))] 

= h(x) - h(x') + 2e h{yl) [h(x') - h(\c 3 (y, y'))} + 2e h{yl) [h(y') - h(x')] 
< d £ (x,x') + 2e h ^[h(y)-h(x)]. 

The last inequality in the statement of Lemma [6. Ill is proved by observing that when h(y') < h(y), 

£h{ y i)[h{y) - K x )\ = £h( y i)[Hy') - h( x ')} < e h{ y )[Kv) - K x ')\ < £ h( y )[h(y) - K x ) + d E (x,x')]. □ 

Definition 6.12. For 5 € (0,1) and x,y,z,w € -Boo, the quadruple (x,y,z,w) is called a 5 -fork, if 

y G Mid(x, z, 5) D Mid(x, w, 5). 

5-forks in H-trees can be approximately classified using the approximate classification of mid- 
point configurations of Section 16.2.11 We have four types of midpoint configurations (recall Fig- 
ure [3]): 

• path-type; denoted (P) in what follows, 

• reverse path-type; denoted (p) — (x,y,z) is of type (p) iff (z,y,x) is of type (P), 

• tent-type; denoted (T), 

• reverse tent-type; denoted (t) — (x,y,z) is of type (t) iff (z,y,x) is of type (T). 

Thus, there are (g) = 10 possible 5- fork configurations in (B^jd^ (choose two out of the five 
symbols "P" , "p" , "T" , "t" , "X" , where "X" means "the same" ). As we shall see, four of these possible 
configurations are impossible, two of them have large contraction of the prongs of the forks, i.e., 
d e (z,w) d £ (x,y), which immediately implies large distortion, and the rest of the configurations 
are problematic in the sense that they are not much distorted from the star (the metric d on 
four points p, q, r, s given by d(p, q) = d(q, r) = d(q, s) = 1 and d(p, s) = d{p, r) = d(r, s) =2). The 
10 possible 5- fork configurations are summarized in Table [TJ 



Midpoint configuration 


Type 


(T||T) 


Type I 


(P||P) 


Type II 


(p||T) 


Type /// 


(Pllt) 


Type IV 


CpIIp) 


prongs contracted 


(tilt) 


prongs contracted 


(p||p) 


impossible 


(P||t) 


possible only as approximate type iT 


(P||T) 


impossible 


(t||T) 


impossible 



Table 1: The ten possible fork configurations. 



For future reference, we give names to the four problematic configurations: 
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Definition 6.13. Forr],5 £ (0,1), a 5 -fork (x,y,z,w) of (-Boo,(f e ) is called 

• r]-near Type I (configuration (T\\T)) in TableUl), if both (x,y,z) and (x,y,w) are n-near 
tent-type configurations; 

• rj-near Type II (configuration (P\\P) in Table [7p, if both (x,y,z) and (x,y,w) are rj-near 
path-type configurations; 

• r]-near Type III (configuration (p\\T) in TableU^, if (z,y,x) is rj-near a path-type configura- 
tion and (cc, y, w) is rj-near a tent-type configuration, or vice versa; 

• rj-near Type IV (configuration (p\\t) in Tabled), if(z,y,x) is rj-near a path-type configuration 
and (w, y, x) is rj-near a tent-type configuration, or vice versa. 

A schematic description of the four problematic configurations is contained in Figured! 




Type / Type II Type III Type IV 

Figure 4: The four "problematic" types of 5- forks. 



The following lemma is the main result of this section. 

Lemma 6.14. Fix 5 G (0, j^j and assume that e n < j for all n E N. If (x,y,z,w) is a 5- 
fork of (Bocdg) then either it is 355d £ (x,y)-near one of the types I, II, III, IV, or we have 
d £ (z,w) ^ 2(355 + £f l0 )d £ (x, y), where ho = min{h(x), h(y), h(z), h(w)}. 

Remark 6.15. One can strengthen the statement of Lemma 16.141 so that in the first case the fork 
(x,y,z,w) is 0(5d e (x,y)) near another fork (x' , y' , z' , w') which is of (i.e. 0-near) one of the types 
/, II, III, IV. This statement is more complicated to prove, and since we do not actually need it 
in what follows, we opted to use a weaker property which suffices for our purposes, yet simplifies 
(the already quite involved) proof. 

The proof of Lemma l6. 141 proceeds by checking that the cases marked in Table [1] as "impossible" 
or "prongs contracted" are indeed so — see Figure [5] for a schematic description of the latter case. 
We begin with the (p\\p) configuration. 

Lemma 6.16. Let (x,y,z,w) be a 5-fork of (Boo, d £ ) and assume that both (z,y,x), and (w,y,x) 
are rjd £ (x,y)-near path-type configurations. Then, assuming that max{<5, 77} < 1/8 and e n < \ for 
all n, we have 

d £ (z,w) ^ (9rj + 65 + 2s h ( y) ) d £ (x,y). 
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Figure 5: The two configurations of 5 -forks with large contraction of the prongs. 



Proof. Let (z 1 , y', x') be a path-type configuration that is rjd E (x, y)-near (z, y, x), and let (uu", y" , x") 
be a path-type configuration that is r]d e (x, y)-near (w, y, x). Without loss of generality assume that 
h(y") ^ h(y'). Let w' be the descendant of y' satisfying h(w') — h{y') = h(w") — h(y") such that 
w' is either an ancestor or an arbitrary descendant of z' . Note that h(w') ^ h{y). Indeed, 

h(w') = h(y') + h{w") - h{y") = h(y') + d E (w", y") > h(y) - \h(y) - h{y')\ + d £ (w, y) - 2r,d E (x, y) 

> h(y) - d £ (y,y') + ^-^d E (x,y) - 2r)d E (x,y) ^ h(y) + (j-^ ~ 3r ^j d e( x 'V) > %)■ 

By Lemma |6.11[ 

d £ (w',w") d E (y',y") + 2e h ^ [h (w") - h (y")] ^ d £ (y',y") + 2e h ^d E (y", w") 

^ 2rjd E (x, y) + 2s h{y) ^2r? + d £ (x, y). (69) 

Observe that 

(j^j ~ 2r ^j d e{x,y) ^ d £ (z,y) - 2rjd £ (x,y) < d £ (z',y') 

< d £ (z,y) + 2r]d £ (x,y) < + 2r/J d E (x, y), (70) 

Since d £ (w',y') = d E (w",y"), we obtain similarly the bounds: 

[ ' ~ 'I 2 V ) d E (x, y) < d E (w', y') ^ d £ (z, y) + 2 V d £ (x, y) ^ + 2ri) d £ (x, y). (71) 

Hence 



4(*>') = |4(?/V) - d E (y',w')\ < f ^—^ + 4?? J (72) 



So, in conclusion, 

d £ (z, w) ^ d e (z, -z') + d £ (w , u/') + d £ (w" , w') + ^(z', u; ) 



■ a (22) / 4^ / i _i_ J\ \ 

^ { 8f l + + 2 \ 2rj + ~ - 5 ) e Ky)) d e{x,y) < (9»7 + 6<J + 2e M!/) )d e (x,y). □ 
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We next consider the (t||t) configuration. 

Lemma 6.17. Let (x,y,z,w) be a 5-fork of (Boo,d e ). Assume that both (z,y,x) and (w,y,x) are 
nd £ (x,y)-near tent-type configurations. Then, assuming that max{ 5, 77} < 1/4, we have 

d e (z,w) < (8?? + 55)d £ (x,y). 

Proof. Let [z' , y', x') be a tent-type configuration that is nd £ (x, y)-near (z, y, x), and let (w", y" , x") 
be a tent-type configuration that is nd £ (x, y)-near (w, y, x). Assume without loss of generality that 
h(y") — h(w") h(y') — h(z'). Let w be a point on the path between w" and y" such that 
h\y") - h{w) = h(y') - h(z'). Then, 



d £ (w",w) = h(y") - h(w") - (h(y') - h(z')) = d e (y",w") - d £ (y',z') 



s% d £ (y,w) - d e (y,z) + 4r?4(x,y) < ( ^-t^ - + 4r? ) d E (x,y). (73) 



1-5 l + <5 

By Lemma lfT9l we have d £ (w,z') ^ d £ (y',y") ^ 2rjd £ (x,y). Hence we conclude that 

d e (y,z) < 4(^,2') +4(^,^0 + d £ (w,w") + d £ (w",w) < ( - _ ^ 2 + 8r? j d £ (x,y). □ 

Lemma 6.18. Lei (x, y, z, w) be a 5-fork of -Boo- Assume that (x, y, z) is r/d £ (x, y)-near a path-type 
configuration. Assume also that 5 < 1/30, n < 1/10, and e n < 1/4 for all n. Then (x,y,w) is 
(2rj + 215) d £ (x,y) -near a path-type configuration, i.e., (x,y,z,w) is (2ry + 215)d £ (x, y)-near a type 
II configuration. 



Proof. Let (x' , y' , z') be a path-type configuration which is rjd £ (x, y)-near (x, y, z). By Theorem l6.4l 
either (x,y,w) or (w,y,x) must be 3<5d e (x,z) ^ Y^d e (x,y) ^ 75d £ (x, y)-near either a path-type 
configuration or a tent-type configuration. 

Suppose first that (x, y, w) is 75d £ (x, y)-near a tent-type configuration (x", y" , it/'). In this case, 
x" is an ancestor of y" and fr(y") — h(x") = d £ (x",y") ^ (1 — 14<5)d e (x, y). At the same time, y' is 
an ancestor of x' and /i(x') — /i(y') = d £ (x',y') ^ (1 — 2r])d £ (x,y). So, 

2(t? + 7(5)4(x, y) ^ d £ (y", y') + d e (x', x") ^ %") - %") + h(x') - h{y') > 2(1 - r, - 75)d £ (x, y), 

which is a contradiction since 77 + 75 < 1/2. 

Next suppose that (u>,y,x) is 75d £ (x, y)-near a path-type configuration (w" ,y" , x"). Then 
|/i(x') -fc(x")| ^ 4(x',x") ^ (77 + 7(5)4 (x,y). So, 

(r/ + 7(5)4 (x,y) ^ d £ (y',y") ^ /i(y") - fr(y') 
= (%") - /i(x")) + (h(x") - h(x')) + ft(x') - %') > - (t? + 7,5)4^, y) + (1 - 2t/)4(x, y), 

which is a contradiction 

Lastly, suppose that (u>, y, x) is 75d £ (x, y)-near a tent-type configuration (u>", y" ,x"). Note that 
1%') -%") I < 4(y',y") < (v + 7S)d £ (x,y). So, h(y') > h(y") - (77 + 75)d £ (x,y). Also, 

%") - /*K) = 4(y" V) ^ 4(y,^)- 14(54 (x,y) > (j^-14s\ d £ (x,y) > (7? + 75)d £ (x,y). 
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Consider the point w denned as the ancestor of y' at distance h(y") — h(w") — (r] + 75)d e (x, y) from 
y' . Let also w'" be the ancestor of y" at distance h(y") — h(w") — (rj + 75)d £ (x,y) from y" . By 
Lemma 16.91 we have d £ (w,w"') ^ d £ (y',y") ^ (rj + 75)d £ (x, y). Therefore, 

d £ (w,w) ^ d £ (w,w m ) + d £ (w"' ,w") + d £ (w" ,w) ^ (2rj + 215)d E (x, y). 

Hence (x,y,w) is (2-q + 215)d E (x, y)-near the path-type configuration (x',y',w). □ 

Lemma 6.19. Let (x,y,z,w) be a 5-fork of B^. Assume that (x,y,z) is rjd £ (x,y)-near a tent- 
type configuration. Assume also that n < 1/10 and e ri < 1/4 for all n. Then (w,y,x) cannot be 
rjd £ (x,y)-near a tent-type configuration. 

Proof. Let (V, y', z') be a tent type configuration that is r]d £ (x, y)-near (x, y, z). Suppose for contra- 
diction that there exists a tent type configuration (w" ,y" , x") that is rjd £ (x, y)-near (w,y,x). Note 
that h{y") > h(y') - d £ (y', y") > h(y') - 2 V d £ (x, y) and h(y') - h(x') > (1 - 2 V )d £ (x, y) > 2rjd £ (x, y). 
Let x* be the ancestor of y' at distance h(y') — h(x') — 2rjd £ (x,y) from y', and let x be the an- 
cestor of y" at distance /i(y') — h{x') — 2rjd £ (x,y) from y" . An application of Lemma 16.91 yields 
the estimate d £ (x,x*) ^ d £ (y',y") ^ 2rjd £ (x,y). But, since ^ h(y"), we also know that 

d £ (x,x") ^ /i(y") - /i(x) = d £ (y',x') - 2rjd £ (x,y). Hence, 

2rjd £ (x,y) ^ 4(£,^*) ^ d e (x, x") - d £ (x* , x') -d £ (x', x") ^ d £ (x ,y') -6rjd £ (x,y) ^ (1 -8rj)d £ (x, y), 

which is a contradiction, since rj < 1/10. □ 

Proof of Lemma \6. 14\ Since (x,y,z,w) is a (5-fork, by Theorem 16.41 both (x,y,z) and (x,y,w) are 
75d £ (x, y)-near a tent-type configuration, a path-type configuration, or the corresponding reverse 
configurations. We have 10 possible combinations of these pairs, as appearing in Table HJ By 
applying Lemmas 16.181 and 16.191 with rj = 75, we rule out three of these configurations, and a fourth 
configuration is possible but only as 355d £ (x, y)-near a type II configuration. 

We are left with six possible configurations. By applying Lemmas 16.161 and 16.171 with rj = 75 
we conclude that in two of those configurations we have d £ (w,z) ^ (695 + 2eh )d £ (x,y), and the 
rest are configurations that are 75d £ (x, y)-near one of the types I—IV. □ 

6.2.3 Classification of approximate 3-paths 

We start with the following natural notion: 

Definition 6.20. For xq, x\, X2, X3 G the quadruple (xq, x%, X2, £3) is called a (1+5) -approximate 
P3 if there exists L > such that for every ^ i ^ j ^ 3 we have 

(j - i)L ^ d £ (xi,Xj) «C (1 + 5)(j - i)L. 

Note that in this case x\ G Mid(xo, X2, 5) and X2 G Mid(xi, X3, 5). 

As in the case of <5-forks, there are 10 possible concatenations of two midpoints configurations 
(path-type or tent-type): P-P, P-p, P-T, P-t, p-P, p-T, p-t, T-T, T-t, t-T (the midpoint config- 
urations p-p, P-p, t-p, T-p, p-P, t-P, T-P, t-t, T-t, t-T are respectively such concatenations with 
the order of xo,xi,X2,X3 reversed). We will rule out some of these possibilities, and obtain some 
stronger properties for the rest. See Tabled 

As in the case of <5-forks, it will be beneficial to give names to three special types approximate 
3-paths: 
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Midpoint configuration Reverse configuration Type 



(P-P) 

(P-P) 
(P-T) 

(P-t) 

(P-P) 
(P-T) 

(P-t) 
(T-T) 
(T-t) 
(t-T) 



(P-P) 
(P-P) 
(t-P) 
(T-P) 

(P-P) 
(t-P) 

(T-P) 

(t-t) 

(T-t) 

(t-T) 



type A 



impossible 
impossible 
type B 



impossible 
type C 



impossible 

possible only as type C 



impossible 
impossible 



Table 2: The possible configurations of 3 paths. 



Definition 6.21. For xq, x±, X2, X3 £ B^ and n > 0, a quadruple (xo, x±, X2, X3) is called: 

• n-near a type A configuration if both (xq,xi,X2) and {x\ , x^. , #3) are 77 -near path-type config- 
urations, 

• r]-near a type B configuration if {xq, xi,x%) is rj-near a path-type configuration, and (2:3, X2, X\) 
is r\-near tent-type configuration, 

• 77 near type C configuration if (x2, xi, xq) is rj-near a path-type configuration, and {x\,X2,x^) 
is rj-near a tent-type configuration. 

See also Figured 



The following lemma is the main result of this subsection. 

Lemma 6.22. Assume that e n < \ for all n and fix 5 < 1/200. Assume that (xq, x\, X2, X3) 
is a (1 + 5) -approximate P3. Then either (xq,xi,X2,x^) or (x^,X2,x\,xq) is 355d £ (xQ,xi)-near a 
configuration of type A, B or C. 




Type A 



Type B 



TypeC 



Figure 6: The three possible types of approximate 3-paths. 
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The proof of Lemma 16.221 is again a case analysis that examines all 10 possible ways (up to 
symmetry) to concatenate two midpoint configurations. The proof is divided into a few lemmas 
according to the cases, and is completed at the end of this subsection. 

Lemma 6.23. Assume that e n < \ for all n and that (xo, xi, X2, X3) is a (1 + 5) -approximate P 3 
such that (xq,xi,X2) is rjd e (xo, x x )-near a path-type configuration. Ifm&x{5, r/} < 1/200 then either 
(xi,X2,xs) is 7Sd £ (xo,xi)-near a path-type configuration (type A), or (x 3 ,X2,xi) is 75d £ (xQ, x x )- 
near a tent-type configuration (type B). 

Proof. Due to Theorem 16.41 we only need to rule out the possibility that (x 3 , X2, x x ) is 78d e (x x ,X2)- 
near a path-type configuration, or that (xi,X2, X3) is 78d e {x\, a^-near a tent-type configuration. 
Let (x' ,Xi,x 2 ) be a path-type configuration that is 7/4 (xq, x\ )-near (xo, xi, X2). 

Suppose first that (x 3 , X2, x x ) is 78d £ (x x , X2)-near the path-type configuration (x 3 ', x 2 , x'{). Since 
h(xi) h(x 2 ) and h(x 2 ) ^ h(x'{) we have, 

\h{x x ) - h(x 2 )\ ^ \h( Xl ) - h{x' x )\ + [h{x' x ) - h{x' 2 )) + \h{x' 2 ) - h(x 2 )\ 

^ d £ (x x ,x' x ) + (h{x[) - h(x' 2 )) + d £ (x' 2 ,x 2 ) ^ 2r)d £ {x ,x 1 ) + (h{x[) - h(x' 2 )) , (74) 

and similarly, 

\h(x x ) - h(x 2 )\ ^ \h(x x ) - h{x'{)\ + (h(x' 2 r ) - h(x'D) + \h(x' 2 r ) - h(x 2 )\ 

^ d £ (x u x'() + (h(4) - h{x'{)) + 4(4, x 2 ) ^ 14(54(xo, xi) + {h(x' 2 ') - h{x'{)) . (75) 

By summing (f?4"|) and (|75p we obtain the bound 

2\h{x x ) - h(x 2 )\ ^ (2r ] + U8)d £ (xo,x 1 ) + d £ (x' x ,x'{) + d £ (x' 2 ,x' 2 ') < (4r/ + 285) 4(x , x x ). 

Thus 

\h(x x ) - h(x 2 )\ ^ {2 V + U8)d £ (x ,x x ). (76) 
Since descendant of x^ , 

\h(x ) - h(x x ) - d £ (xo,x x )\ ^ \h(x' ) - h(x[) - d £ (x ,x x )\ + 2r]d £ (x ,x x ) 

= \d £ (x' Q ,x' x ) - d £ (x ,x x )\ + 28d £ (x ,x 1 ) ^ 4r/4(x ,xi). (77) 

Similarly, since x% is a descendant of x 2 , 

\h(x 3 ) - h(x 2 ) - 4(x ,xi)| ^ \h{x'z) - h(x 2 ) - 4(x ,xi)| + U5d £ (x x ,x 2 ) 

= \d £ (xz,x 2 ) -4(x ,xi)| + 14<5d e (x ,xi) ^ 28^4(x ,xi). (78) 

Hence, 

\h(x'z) - h(x' )\ < |/i(xg) - h(x 3 )\ + |/i(x 3 ) - /i(x 2 ) - 4(^0, + \h(x 2 ) - h(x x )\ 
+ \h(x ) - h(x x ) - d £ (xo,x x )\ + \h(x ) - h(x' Q )\ 

(f76l A(f77lAj78l 

< 4(x 3 ',x 3 ) + 288d £ (x ,x x ) + (2t] + U5)d £ (x ,x x ) + Arjd £ (x ,x x ) + d £ (x ,x' ) 
^ (498 + 7rj)d £ (xo,x x ). (79) 
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We record for future reference the following consequence of (j76|) and ([79]) : 

mm{h(x' ) , h(x'^)} — min{/i(x' 1 ), /i(x 2 ')} ^ max {h(x' ) — h(xi), h(x' ) — h^x^)} 

ll79t 

< max{d E (x' ,x' 1 ),h(x'^) - hix'^) + (495 + 7rj)d £ (x , xi)} 

^ max { (1 + 2rj) d e (x , x x ) , 4 (x 3 ' 5 4' ) + (495 + 7rj) d e (x , x x ) } 

< (1 + 645 + 7t7)4(x ,x 1 ). (80) 
We next claim that 

lca(x^x^) = lca(x / 1 ,x / 2 '). (81) 

Indeed, since x[ is an ancestor of x' and x 2 ' is an ancestor of x 3 ', if \ca(x' ,x'^) ^ Ica^'^x^) then 
either x[ is a descendant of X<2 1 Or X2 IS cl descendant of x[. If x[ is a descendant of x 2 ' then 

(77 + 75)d e {xQ,x\) ^ 4(^1,2:1) ^ 4(^2^1) > 4(^2, xi) - (77 + 75)d £ (x ,x 1 ) 

> - r —=d e (x ,x 1 ) - (77 + 75)4(x ,Xi), 

which is a contradiction since 5, 77 < 1/200. Similarly, if descendant of x\ then 

(7/ + 75)4(x ,xi) > 404^2) > 404'> aJ l) ^ 7ti4(io^i) - (v + 7S)d £ (x , xi), 

l + o 

arriving once more at a contradiction. This proves (|8ip . 
Now, 

3 

4(x ,Xi) < 4(x ,X3) 



1 + 5 



^ 4(^3^0) + + 7(5)4(x ,xi) 

ll79t 

< 2e m i n {? l (a : y,ft(x^)} [min{M4)>M4')} - h(\ca(x' , a#))] + (877 + 565)4 (x , x x ) 

® ^mm{h(x' () ),h(^)} [ min iH x i)i h ( x 2)} ~ ^(lca(x / 1 ,x / 2 / ))] + (8?? + 565)d E (x , Xi) 
+ ^mm{h(x' ),h(^)} [min{/i(x' ),/i(x3')} - min{/i(x / 1 ),/7(x / 2)}] 

ll80t 

«S 2e m in{/ l ( a: i)^(4')} [^^{^(a?!)^^)} - h(\ca(xi,X2))] 

+ (sr? + 565 + 1 + 64 * + 7 ^ 4(x , xx) (82) 

^ 4(^1,40 + Q + 885 + 12? ?) 4(^o, xi) 

^ Q + 965 + 13^ 4(^o, (83) 

where in ([82]) we used min{h(x' Q ) , h(x'^)} ^ min{/i(x' 1 ), /i(x 2 ')} and £ m m{h(x' ),h(x'^)} < 1/4- Since 
max{77, 5} < 1/200, the bound (|83[) is a contradiction. 
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Next suppose that (xi,X2,X3) is 7<54(xo,xi)-near a tent-type configuration (x'{, x 2 , x 3 '). Since 
h{x' 2 ) ^ M^'i) and h(x 2 ) ^ h(x'{), we have 

\h( Xl ) - h(x 2 )\ ^ \h( Xl ) - h(x[)\ + (h(x[) - h(x' 2 )) + \h(x' 2 ) - h(x 2 )\ 

< de(xi,x{) + (/l^) - h{ X ' 2 )) + - fr(x?)) + d £ (x' 2 ,X 2 ) 

4 (xi ,x[) + d £ (x[ , x") + d £ (x' 2 , x 2 ') + d £ (x 2 , x 2 ) 
^ {4r ] + US)d £ {x ,x 1 ). (84) 

On the other hand, x'( is an ancestor of x 2 , and therefore we have 

(j^j - 145) 4(^0^1) < 4K,40 = - MO 

ll84t 

^ -h(x 2 )\ + 14<54(x , xi) ^ (477 + 28<5)4(x , x x ), (85) 

which is a contradiction since max{r/, 5} < 1/200. □ 

Lemma 6.24. Assume that £ n < \ for all n and that (xo, x%, x 2 , X3) is a (1 + 5) -approximate 
P3 suc/i i/iai (x2,xi,Xo) is nd £ (xo,xi)-near a path-type configuration. If max{(5, 77} < 1/200 then 
either (xz,x 2 ,xi) is 75d £ (xo,xi)-near a path-type configuration (reverse type A), or (xi,x 2 ,X3) is 
75d £ (xQ,xi)-near a tent-type configuration (type C). 

Proof. Let (x^x'^Xq) be in path-type configuration that is nd £ (xo, xi)-near (x2,xi,xo). First, 
assume for contradiction that (x3,X2,xi) is 7<54(xi, X2)-near a tent-type configuration (x'g, x 2 , x'{). 
Then h{x'{) ^ h(x 2 ), where as h(x 2 ) — h(x' 1 ) = 4(x' 2) x / 1 ). Arguing as in (j84"|) . it follows that 
\h(xi) — h(x 2 )\ ^ (277 + 28(5)4 (xo , xi), and we arrive at a contradiction by arguing similarly to ([85]) . 

Next, assume for contradiction that (xi,X2,X3) is 7<54(xi, X2)-near a path-type configuration 
(x'{, x 2 , x 3 '). Then h(x'{) — h(x 2 ) = d £ {x'[,x 2 ), whereas h(x 2 ) — h(x' 1 ) = d £ (x' 1 ,x 2 ). By summing 
these two identities, we arrive at a contradiction as follows: 

(t^ - 2rj - 14^ 4(x ,xi) ^ 4(x' 1 ,x / 2 ) + 4(x' 1 , ,x 2 ') = {h{x' 2 ) - h(a%)) + {h(x'{) - h(x[)) 

< 4(x / 2 ,x 2 ') + 4(x / 1 ,x / /) sC (2?7 + 14J)4(xo,xi). □ 

Lemma 6.25. Assume that e n < 4 for all n and that (xq,xi,x 2 ,xs) is a (1 + 6) -approximate P3 
such that (xo,xi,X2) is nd £ (xo,xi)-near a tent-type configuration. 7/max{<5, 77} < 1/200 then either 
(x2,xi,xo) is (145 + 3t7)4(xo, x\)-near a path-type configuration and (xi,X2,X3) is 75d £ (xi, x 2 )- 
near a tent-type configuration (type C ), or (X3, X2, xi) is 75d £ (xo, x\)-near a path-type configuration 
(reverse type B). 

Proof. Let (x^x'^x^) be a tent-type configuration that is nd £ (xo, xi)-near (xo,xi,X2). First, 
suppose that (xi,X2,X3) is 75d £ (xi, x 2 )-ne&r a tent-type configuration (x'{, x 2 , x 3 '). Note that 
\h(xi) — h(x'{)\ ^ d £ (x' 1 ,x'{) ^ (77 + 7<5)4(xo, xi). So, let x'q be an ancestor of x'{ at distance 
^(x^) — h(x' Q ) — (77 + 75)4(xo ; xi) € [0, h{x'l)} from x", and let Xq be an ancestor of x[ at dis- 
tance h(x' 1 ) — h(x' ) — (77 + 75)4(xo,xi) from x\. Then h(x[) — /i(xq) = h{x'{) — /i(x'q) and 
4(xo,x' ) ^ (77 + 75)4(xo> xi). By Lemma [6T9l 

4(x ,Xo) - (277 + 75)4 (xo, xi ^ 4(x ,Xo) - 4(xq,x' ) - 4(xq,x ) ^ 4(x$,x / ) 

^ 4(x' 1 ,x") ^ (77 + 75)4 (x ,xi). 
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Hence (x 2 ,x'{,x'q) is a path-type configuration that is (145 + 3r/)d £ (xo, xi)-near (x2, x\, xq). 

Next assume for contradiction that (x3,X2,xi) is 75d £ (xi, X2) near a tent-type configuration 
(x' 3 ' , x'2 , a?i ) • Then 

(1 - 155) d E (x ,xi) ^ - 145^ d e (x , Xl ) ^ h(4) - h(a%) < (1 + 155K(x ,xi), (86) 

and 

{l-8-2ri)d e {x Q ,xx) < (^-^ - 2r^ d £ (x , Xl ) ^ h(x[) - h(x' ) < (1 + 5 + 2 v )d £ (x , x x ). (87) 

So, let x** be an ancestor of x 2 at distance fo(x 2 ) — h(x'l) — (165 + 2rj) d £ (xQ, x\) G [0, ^(x^)] from 
x 2 , and let Xq be an ancestor of x[ at distance /i(x 2 ) — h(x 3 ) — (165 + 2r/) d e (xo,xi) 6 [0, /i(xi)] 
from x^. Then 

d £ (x 3 ',xf # ) ^ (165 + 277)4(x ,a?i), (88) 

and 

d e (x' ,x*) = \h(x' ) - h(xf)\ = \h(x' ) - (h(x[) - h{x'i) + h(4) + (165 + 2r,)d s {x ,xx))\ 

ll86tAl87t 

< 2(165 + 2rj)d e (x ,xi). (89) 
Moreover, h(xi) — h(x^) = /i(x 2 ) — h(xf ), so by Lemma 16.91 we have 

l88tAll89t 



1 + 5 



555 -7r] J d £ (x ,x 1 ) ^ d £ (x ,x 3 ) - (555 + 7r])d £ (x ,xi) < d £ (x^,x^ w ) 

< d £ (xi,X2) ^ (1 + 85 + rj)d e (xQ,xi) 



which is a contradiction since max{5, rj] < 1/200. 

Lastly, assume for contradiction that {x\, X2, x$) is 75d £ (x\, X2)-near a path-type configuration 
(x'{, x'2, x 3 '). Then since ^(x^) ^ h(x' 2 ) we have 

iTS ~ U5 ) d e( x o,^) < 4(4,4') = - < W) - ^(40) + (^04) - ^(4)) 

< 4(4,x' 1 ) + 4(4,4) < (145 + 2r ? )4(x ,xi), 
a contradiction. □ 

Lemma 6.26. Assume that e n < I /or a// n and i/iai (xo, xi, X2, X3) zs a (1 + 5) -approximate 
P3 suc/i i/iai (x2,xi,xo) «s r/d £ (xo,xi)-near a tent-type configuration. // max{5, n} < 1/200 f/ien 
(xi,X2,X3) cannot be 75d £ (xQ, x±) near a tent-type configuration. 

Proof. Let (x' 2 , 4>4) be a tent-type configuration that is r/d £ (xq, xi)-near (x2,xi,xo). Suppose 
for contradiction that (xi,X2,X3) is 75d £ (xQ, xi)-near a tent-type configuration (x'{, x 2 , x 3 '). Then 
/i(x' x ) — /i(x' 2 ) = d e (x^,x 2 ), whereas /i(x 2 ) — ^(4) = ^Ki^')' Taking the sum of these two 
inequalities we conclude that 

4(4,4) + d e (4,4) sC 4(4, x") + 4(4,4) < (2n + 145)4(x , xi). 



At the same time, — 2rj — 145) d e (xo,xi) ^ 4(4,4) + d £ (x' 1 ,x' 2 ), which leads to the desired 
contradiction. □ 
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Proof of Lemma 1 6. 231 Since (xo, x±, X2, X3) is a (l+5)-approximate P3, we have x\ G Mid(xo, £2, 5), 
and X2 G Mid(xi, X3, 5). Since the assumptions of Theorem 16.41 hold, we can apply with r] = 75 
Lemmas I6.23[ HT24"! 16.251 16.261 and conclude that either (xo, x\, X2, X3) or (X3, X2, Xi, xo) must be 
355d £ (xo, xi)-near a configuration of type A, B or C. □ 

6.3 Nonembeddability of vertically faithful £? 4 

In what follows we need some standard notation on trees. As before, B n is the complete binary 
tree of height re; the root of B n is denoted by r. Denote by I(B n ) the set of internal vertices of B n , 
i.e., vertices of B n which are not the root or a leaf. For a vertex v in {r} U I{B n ) we denote by 
vo and v\ its children. For a G {0, 1}* (the set of finite sequences of '0' and '1') and a G {0, 1} we 
denote by v aa = (v a ) a . 

The aim of the current section is to prove the following lemma. 

Lemma 6.27. Fix < 5 < 1/400 and let f : B4 — > (B^^d^ be a (1 + 8) -vertically faithful 
embedding. Then the distortion of f satisfies 

dist(/) > 1 



500<5 + e ho 
where h = minr^ h(f(x)). 

The proof of Lemma 16.271 is by a contradiction. By Lemma 16.141 assuming the distortion of 
/ is small, all the <5-forks in the (1 + <5)-vertically faithful embedding must be of types I—IV. By 
exploring the constrains implied by Lemma 16.221 on how those <5-forks can be "stitched" together, 
we reach the conclusion that they are sufficiently severe to force any vertically faithful embedding 
of B4 to have a large contraction, and therefore high distortion. 

Fix / : B4 — > (Boojdfr). For u G -^(-84) we denote by J~(u) the fork in which u is the center 
point, i.e., if v be the parent of u in B4, then 

7-(n) = (/(«), /(«),/(«o),/(«i)). 

We shall assume from now on that / satisfies the assumptions of Lemma l6.271 i.e., that it satisfies ([8|) 
with D = 1 + 5 for some 5 < 1/400 and A > 0. 

Lemma 6.28. Fix u G -B4 with h(u) G {1,2}. If the fork J-{u) is 375\-near a type I or type III 
configuration, then there exists w G I{B/{) satisfying 

d £ (f(wo),f(w 1 )) ^ (170(5 + e ho ) ■ 2A. (90) 

Proof. Let v be the parent of u. Hence, (/(f), f(u), f{uo), f(ui)) is 355(1 + 5)A-near a type / or a 
type III configuration. Assume first that (f(v),f(u),f(uo),f(ui)) is 37(5A-near a type / configu- 
ration. If both (f(uo),f(u),f(v)) and (f(ui),f(u),f(v)) were 37<5A-near a path type configuration 
then by Lemma 16.161 (with 7/ = 375) we would have 

4(/(u ), /(«!)) ^ (3395 + 2e ho )(l + S)X ^ (1705 + e ho ) ■ 2A, (91) 

proving (f90|) with w = u. The same conclusion holds when (f(v),f(u),f(uo),f(ui)) is 375A-near 
a type 177 configuration: in this case without loss of generality (f(v),f(u),f(uo)) is 375A-near 
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a tent-type configuration and (/(«i), f(u), f(v)) is 375A-near a path-type configuration. Using 
Lemma r6.16l as above we would arrive at the conclusion (|9Tj) if (/(«o), /(n), f(v)) were 375A-near a 
path-type type configuration. Thus, in both the type / and type III cases of Lemma 16.281 we may 
assume that (f (v) , f (u) , f (uq)) is 375A-near a tent-type configuration, and that, by Lemma EH 
(f (v) , f (u) , f (uq)) is not 375A-near a path-type configuration, and (f (uo) , f (u) , f (v)) is not 375A- 
near a path-type configuration or a tent-type configuration. 

By Lemma [6.221 (and Table [2]) (f(uQ C ), /(no), f{u), f(v)) must be 35(5(1 + 5)A-near a type B 
configuration for both c G {0,1}. This means that {f (uo c ) , f (uq) , f (u)) are both 355(1 + 5)A- 
near a path-type configuration, and so by Lemma 16.161 (with n = 355(1 + 5)) we deduce that 
4(/Ko), /(uoi)) < (1705 + Eho ) ■ 2A. □ 

Lemma 6.29. Fix n G £?4 lyzf/i /i(n) G {1,2}. If F{u) is 375X-near a type II configuration then 
for both b G {0, 1} either J-(ub) is 995X-near a type II configuration, or d E (f(ubo), /(it&i)) ^ 4005A. 

Proof. Let v be the parent of n. For both c G {0,1} we know that (f (v) , f (u) , f (uq) , f (uq c )) 
is a (1 + 5)-approximate P3, and therefore by Lemma 16.221 either (f(v),f(u),f(uo),f(uo c )) or 
(/(noc), f(uo), f(u), f(v)) is 355(1 + 5)A-near a configuration of type A, B or C. Note that since 
(f(v), f(u), f(uo)) is assumed to be 375A-near a path-type configuration, we rule out the possibility 
that (f(v), f(u), f(uo), f(uQ C )) is 355(1 + 5)A-near a configuration of type C, since otherwise both 
(f (v) , f (u) , f (uo)) and (f(uo),f(u),f(v)) would be 375A-near path-type configurations, contra- 
dicting Lemma [6781 For the same reason we rule out the possibility that (f(uo c ), /(no), f(u), f(v)) 
is 355(1 + 5)A-near a configuration of type A or type B. An inspection of the three remain- 
ing possibilities shows that either (f(u),f(uo),f(uQ C )) is 375A-near a path-type configuration, or 
(f( u Oc),f(uo),f(u)) is 375A-near a tent-type configuration. 
Now, 

• If for both c £ {0, 1} we have that (f(u), f(uo), /(hoc)) is 375A-near a path-type configuration, 
then J-(uo) is 375A-near a type II configuration. 

• If for both c € {0, 1} we have that (f(uo c ), f(uo), f(u)) are 375A-near a tent-type configura- 
tion, then by Lemma 16. 171 we have d £ (f(uoi), /(t*oo)) ^ 4005. 

• By Lemma l"6.18| the only way that (f(u), /(no), f(uao)) could be 375A-near a path type con- 
figuration while at the same time (/(itoi), /(no), f(u)) is 375A-near a tent-type configuration 
(or vice versa), is that .F(no) is 995A-near a type II configuration. □ 

Lemma 6.30. Fix u G -B4 toit/j /i(n) G {1,2}. If J-(u) is 355(1 + 5)X-near a type IV configuration, 
then there exists b G {0, 1} such that J-(v,b) is 375X-near a type II configuration. 

Proof. Let v be the parent of u. Without loss of generality (/(no), /(n), /(f)) is 355(1 + 5)A-near a 
tent- type configuration. By Lemma [6.221 (using Lemma [6.81 to rule out the remaining possibilities), 
this means that for both c G {0, 1} the quadruple (f(uo c ), /(no), f(u), f(v)) is 355(1 + 5) 2 A-near a 
type C configuration, and therefore J-{uq) is 355(1 + 5) 2 A near a type II configuration. □ 

Lemma 6.31. Fix u G B4 with h(u) G {0, 1,2}. If T(uq) and J-(ui) are both 995X-near type a II 
configuration then d e (f(uo),f(ui)) ^ 10005A. 
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Proof. By our assumptions, (/(«), f(uo), /(«oo)) is 995A-near a path type configuration (u', u' Q , u' Q0 ) 
and (f(u),f(ui),f(uio)) is 99<5A-near a path-type configuration (u" ,u",u" ). We may assume 
without loss of generality that h(u") — h(ui) ^ h(u') — h(u' ). We may therefore consider the 
ancestor u* of u' such that h{u') — h(u\) = h{u") — h(u'{), implying in particular that h{u\) ^ h(u' ) 
(recall that u' Q is an ancestor of u', and u'( is ancestor of u"). By Lemma 16.91 we have 



d £ {u[,u'{) ^ d £ (u',u") < 1985A. (92) 



Hence, 



J2U 

/i(u') - h(ul) = d £ (u',ul) ^ d e {u',u") - 1985A > d £ (f(u),f( Ul )) - 3945A > (1 - 3945)A. (93) 
But, we also know that 

h{u') - h(u' ) = d £ {u',u' ) < d £ (f(u),f{u )) + 1985A < (1 + 2005)A. (94) 
It follows from O and (JM^ that d £ (u' Q ,u\) = h(u\) - h(u' ) ^ 6015A. Therefore, 

4(/(«i),/(«o)) ^4(/(n ),«d) + 4K,nD + 4K,O + 4K,/(«i)) = 10005A. □ 



Proof of Lemma 6.21. We may assume that for all u € I{B/{) the fork ^(m) is 355(1 + 5)A-near a 
configuration of type /, II, III, or IV. Indeed, otherwise the proof is complete by Lemma 16.141 
If Tiro) is 355(1 + 5)A-near a type I or type III configuration, then by Lemma T6.28I the proof is 
complete. If F(ro) is 355(1 + <5)A-near a type IV configuration then by Lemma 16.301 there exists 
b € {0, 1} such that F(rob) is 375A-near a type II configuration. It therefore remains to deal with 
the case in which for some u G {ro, rob} the fork J~(u) is 375A-near a type II configuration. Applying 
Lemma f6. 291 either we are done, or both !F(uq) and F{ui) are 995A-near a type 77 configuration, 
but then by Lemma 16.311 the proof of Lemma 16.271 is complete. □ 

6.4 Nonembeddability of binary trees 

We are now in position to complete the proof of Theorem 11.101 

Proof of Theorem \ 1.1 PL Write e n = l/s(n), and e = {e n }^ =0 . Thus {e n }^ =0 is non-increasing, 
{ne n }^L is non-decreasing, and e n ^ 1/4. We can therefore choose the metric space (X,dx) = 
(-Boo, d £ ). The identity embedding of B n into the top n-levels of shows that cx(B n ) ^ s(n). It 
remains to prove the lower bound on cx{B n ). To this end take an arbitrary injection / : B n — > X 
satisfying dist(/) ^ s(n), and we will now prove that 



dist(/) ^ s 



n 



40s(n) 



1 _ Cs(n) logg(n) . 
logn 



By adjusting the constant C in (I95I) . we may assume below that n is large enough, say, n ^ 100. 
Write ho = [n/(A0s(n))\ and define X > i %0 = {x G B^ : h(x) > ho}. We claim that there exists 
a complete binary subtree T C B n of height at least [n/3], such that we have f(T) C X > f lQ . 
Indeed, let hmin = min{/i(x) : x S f(B n )} and h max = max{/i(x) : x G f(B n )}. If /i m i n > ho 
then f(B n ) C X > / lQ , and we can take T = l? n . So assume that /i m j n < /to- Since / is an 
injection it must satisfy /i max > n. Hence ||/||Li P ^ fcmax 2n fcmin ^ !1 2^ :fl ^ |- Since dist(/) ^ s(n) 
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we conclude that ||/ _1 ||Lip ^ 4s(n). It follows that, since diam(A" \ X > f H) ) ^ 2/io, we have 
diam(/~ 1 (X \ X > h )) ^ 8hos(n) ^ n/5. If the top |~n/3] levels of B n are mapped into X > h then 
we are done, so assume that there exists u £ f^{X \ X > f lQ ) of depth at most ^ f / 3~| . In this 
case f~ 1 (X \ Xyhg) must be contained in the first [n/3] + n/5 < 2n/3 — 1 levels of B n , so we can 
take T to be any subtree of B n contained in the last [n/3] levels of B n . 

Fix S G (0,1). By Theorem 11.141 (with t = 4, D = s(n) and £ = S), there exists a universal 
constant k > such that if n ^ s(n) K / 5 then there exists a mapping : £>4 — >■ S n with dist(</>) ^ l+<5 
such that / o (p is a (1 + J)-vertically faithful embedding of B^ into X > h . Choosing 5 = k 1 "^^ , 
by increasing C in (j95j) if necessary, we may assume that 5 < 1/400. Lemma 16.271 then implies 

(1 + 5)dist(/) ^ dist(/o<£) > ' ' 



5005 + e hQ 500k^^ 



+ 



logn 1 s(|n/(40s(n))J) 

The deduction of ([7]) from is a simple exercise: if s(n) = o(logn/ log logn) then we have 
(s(n) log s(n))/ log n = o(l). The desired claim will then follow once we check that 

s(\n/U0s(n))\) , . 

limsup VL /v v = 1. (96) 
n— >oo <s(n) 

Indeed, if (|96|) failed then there would exist e$ € (0, 1) and no € N such that for all n ^ no, 

s(Ln/lognJ) < s(\n/(4Qs(n))\) < (l-e )a(n). (97) 

Iterating ([97]) . it would follow that s(rij) n^ 1 '' for some subsequence a contradiction. □ 

Proof of Theorem \1.12L The proof is identical to the above argument: all one has to notice is 
that when s(n) = D for all n £ N the resulting metric <i e on 5^ is /^-equivalent to the original 
shortest path metric on B^. In this case, if cx(B n ) ^ D — e then the bound (|95p implies that 
n < D CE>2 / £ . □ 



7 Discussion and open problems 

A very interesting question that arises naturally from Theorem 11.31 and is also a part of the Ribe 
program, is finding a metric characterization of (/-smoothness. A Banach space (X, || ■ \\x) is called 
g-smooth if it admits an equivalent norm ||| ■ ||| such that there is a constant S > satisfying: 

n . _ Y llk + ylll + IH^-ylH . , . „ q 

\\\x\\\ = 1 A y G X ==>■ ^ 1 + S'|||y||| y . 

A Banach space X is p-convex if and only if its dual space X* is q-smooth, where - + - = 1 |17j . 
It is known that a Banach space X is p-convex for some p < oo (i.e., super reflexive^ if and only if 
it is g-smooth for some q > 1 (this follows from [TUl I29j ). Hence Bourgain's metric characterization 
of superreflexivity can be viewed as a statement about uniform smoothness as well. However, we 
still lack a metric characterization of the more useful notion of ^-smoothness. Trees are natural 
candidates for finite metric obstructions to p-convexity, but it is unclear what would be the possible 
finite metric witnesses to the "non-q-smoothness" of a metric space. 

.ff-trees are geometric objects that are quite simple combinatorially, yet as we have seen, they 
have interesting bi-Lipschitz properties. It would therefore be of interest to investigate the geometry 
of H-trees for its own right. In particular, what is the L\ distortion of an H-tree? How close can 
an H-tiee be to a metric of negative type? 
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